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ABSTRACT 

Policies and practices related to grading at 14 
colleges and universities (8 public and 6 private) and how they have 
changed between 1980 and 1990 were studied along with the grading 
orientations and practices of over 500 faculty members. Changes 
between 1980 and 1990 that may have affected grades include greater 
prescription of the curriculum, greater differentiation in grading 
systems, and increased use of student evaluations of faculty members. 
While none of the department chairpersons said there were specific 
departmental grading policies, about one quarter of the faculty felt 
that there was a policy of grading against specific standards, and 
about two-thirds said that their departments expected them to grade 
against specific standards, even though there was no defined policy. 
Some faculty members perceived grades as formal and objective, while 
others thought they could not be reduced to objective measures. Most 
believed that grades are higher than they used to be because faculty 
expects less of students today. Eighty-one percent of faculty said 
they sometimes used a criterion-referenced approach, but only 647e 
said they used it most often. Twenty-nine percent used a 
norm-referenced approach most often. Twenty-four tables present study 
findings. (Contains 91 references.) (SLD) 
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Abstract 

This study examined policies and practices related to 
grading at 14 colleges and universities and how they 
had changed between 1980 and 1990. It also examined 
the grading orientation and practices of over 500 fac- 
ulty members in the business, chemistry, education, 
English, history, mathematics, and psychology depart- 
ments of these institutions. 

The study sought information to answer four 
questions: 

1. What are the current institutional and/or de- 
partmental grading policies and practices? 

2. Were there changes in these policies and prac- 
tices between 1980 and 1990 and, if so, what 
was the nature of the changes? 

3. What are the current grading orientation and 
practices of faculty who teach undergraduate 
courses? Have these changed over time? 

4. Do faculty grading orientation and practices 
differ across departments and, if so, how? 

Information was collected from 14 colleges and 
universities, 8 public and 6 privately controlled. Within 
each institution, information was collected from depart- 
ment chairs and faculty in each of the designated de- 
partments that was established at that institution. Us- 
able questionnaire responses were received from 58 
department chairs (68 percent response rate) and from 
542 faculty members (25 perceni response rate). 

Institutional changes between 1980 and 1990 that 
may have affected grades included greater prescription 
of the curriculum, greater differentiation in grading sys- 
tems, and increased use of student evaluations of faculty 
members. 

While none of the department chairpersons said 
there were specific departmental grading policies, fac- 
ulty had a very different perspective. Approximately a 
quarter of the faculty said their department had a policy 
of grading against specific standards, 14 percent of 
these said there was a policy that all students be given 
an A or B in honors courses, and 1 1 percent said there 
was a policy that attendance should be a factor in 
course grades, Additionally, about two-thirds of the fac- 
ulty said that, although not specific policy, their depart- 
ment expected them to grade students against specific 
standards. About half of these faculty said their depart- 
ment expected them to grade students relative to the 
overall performance of the class. 

Only about a third of the department chairpersons 
reported having formal meetings to discuss grading, but 
informal meetings about grading were reported by 75 
percent. Fifty-six percent of the responding faculty said 



their department had tried to raise standards during the 
decade between 1980 and 1990. 

Faculty orientations toward grading could be cate- 
gorized into two approaches, one viewing grades as 
forma! and objective, the other insisting that grades 
cannot be reduced to a set of objective measures. There 
were significant differences across departments, with 
chemistry, mathematics, and psychology faculty more 
likely to subscribe to the "objective" view whili? Eng- 
lish, education, and history faculty tended to favor the 
"nonobjective" view. Faculty in business appeared to 
have diverse attitudes toward grading and could not be 
easily categorized into either group. 

Forty-three percent of the responding faculty said 
their grading philosophy had charged since they began 
to teach. Faculty believed that the meaning of a grade 
varies more across disciplines than across institutions 
and that the major reason why grades today are higher 
than they were 20 years ago is because faculty now ex- 
pect less of students. 

When asked about the purpose of grades, respond- 
ing faculry saw the primary purposes as providing feed- 
back n students, providing information about stu- 
dents to graduate or professional schools, motivating 
students to do good work, and helping the college or 
university make decisions about students. 

Faculty were asked if they ever used each of three 
grading approaches. Eighty-one percent said they some- 
times used a criterion-referenced approach, 5 T percent 
a norm-referenced approach, and 44 percent a self- 
referenced approach. When asked which one they used 
most often, 64 percent named the criterion-referenced 
approach, 29 percent the norm-referenced approach, 
and 8 percent the self-referenced approach. 

When asked about the importance they gave to var- 
ious components of grades when assigning grades in in- 
troductory and advanced courses, tests and quizzes 
were rated as most important for introductory courses, 
papers and written assignments for advanced courses. 
Other factors that took on increased importance in 
grading advanced courses included oral reports, cre- 
ativity, class participation, group projects, and subject- 
specific skills and techniques. There were many signifi- 
cant differences across departments in the importance 
assigned to different components of grades. 

Responding faculty were more likely to use essay 
than multiple-choice tests in their introductory courses, 
but again, there were significant differences by depart- 
ment. 
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Introduction 

It is common knowledge among members of the higher 
education community that grading practices lack uni- 
formity; grade distributions vary considerably among 
sections of the same course and across instructors, de- 
partments, and institutions, When Lewis, Dexter, and 
Smith (1978) studied the grading procedures of college 
English teachers, they found 10 different patterns of in- 
formation used to arrive at grades. Small wonder then 
that those analyzing grades have made comments such 
as "The unreliability of grades within departments and 
the variability of grading standards across departments 
is apparent to all who have looked into these matters" 
(Fishman 1958, 341). 

This report explores some of the factors involved in 
college grades. It describes a study of current under- 
graduate course grading policies and practices in 14 col- 
leges and universities. It also provides some insights into 
how these grading policies and practices may have 
changed between 1980 and 1990. 

Grading is clearly a complex issue. Grades may be 
aiicctcd by institutional and departmental policies as 
well as by the types of individuals doin-; the grading, the 
grading orientation and practices of Jicsc individuals, 
and the kinds of students who enroll in a given institu- 
tion or a given course. Most of the exiting research on 
grading has looked at only one or two of these factors. 
To get a more comprehensive picture, information was 
collected at the institutional, departmental, and indi- 
vidual faculty levels to determine how they interact and 
which appear to be primary influences. 

Background and Rationale 

Grades and grading practices are topics of widespread 
interest, as well as some controversy, in higher educa- 
tion. Underlying the discussion of grades has been a 
concern for standards, which at times have been con- 
sidered "too stringent" and at other times "too lax." 
For example, during the 1950s and 1960s there was 
concern over the increasing rigor of grading standards 
(sec Webb 1959; Aiken 1963; Hills 1964; Hills and 
Klock 1965; Miller 1965; Hills 1967; Hills and Gladncy 
1968; Wilson 1970). During the past two decades, es- 
pecially in the 1970s, the concern was the declining 
value of grades, or what has come to be known as 
"grade inflation" (sec Juola 1974; Etzioni 1975; Suslow 
1977; Bcjar and Blew 1981; Milton, Pollio, and Eison 
1986; Summcrvillc, Ridley, and Maris 1988). In addi- 
tion to the possibility that grade inflation may represent 
an erosion of academic standards or be a counteraction 



to the more stringent standards used in the 1950s and 
1960s, it may also represent real increases in student 
achievement - , demographic changes, or institutional 
grading policies that permit grade-point averages 
(GPAs) to rise even if grading standards remain un- 
changed (Birnbaum 1977). It is likely that, on average, 
grade inflation was a phenomenon of the 1970s; pair- 
wise matches of validity studies (Ramist and Weiss 
1990) revealed a huge grade inflation in the 1970s but 
none thereafter. 

Sabot and Wakeman-Linn (1991) reported evidence 
from nine colleges and universities that grade inflation 
during the past 25 years has resulted in an increasing 
disparity of grades and grading policies within institu- 
tions, splitting many into high- and low-grading depart- 
ments. The authors argue that these differences affect 
course choice and make it difficult to attract students to 
fields like mathematics and science in which grades tend 
to be lower than in the arts and humanities. Students 
use grades to determine if they should take further 
courses in a field and to make decisions about the 
choice of a major and/or a career (McKcachie 1986). 

Another aspect of the current concern about college 
grades is the question of their adequacy as an indicator 
of college achievement. Since the chief function of 
grades is to convey information about students, unex- 
plained changes and variations in grading policies raise 
questions about their validity. As Gcisinger (i982) 
pointed out in his review of marking systems, "nu- 
merous studies have demonstrated that individual 
teachers differ in the kinds of marks they assign and 
that these differences arc stable." Duke (1983) found 
differences in grading practices among the colleges of 
one university, among the various disciplines, and 
among the instructors. These differences have led some 
to question the almost universal use of grades in making 
educational admission and employment decisions. (See 
Hoyt 1966, 1970; Humphreys 1968; O'Leary, 1980.) 

A related concern about grades centers on their re- 
lationship to test scores and their predictability, espe- 
cially as used in college and graduate school admission. 
Various grade adjustment methods have been em- 
ployed; for a summary of these see Young (1993). 
Lewis, Dexter, and Smith (1978) found that grades 
from English teachers who relied on each of several dif- 
ferent sets of factors (marking patterns) in assigning 
grades correlated differently with test scores. In consid- 
ering factors that might influence the relationship be- 
tween test scores and freshman grades, Willingham 
(1990a) listed the following variations that might be ob- 
served in the criterion (grades): 

• characteristics of grades (courses included, 
grading practices); 
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• meaning of grades (skills or type of performance 
evaluated); 

• grading variations (due to different standards or 
performances); and 

• educational practices (remediation, placement). 

Willingham suggested a variety of factors that may 
have altered the predictability of the freshman GPA in- 
cluding: rigor of the grading process, grade inflation, 
variation in grading standards across courses, the inci- 
dence of remedial and advanced course work, diversity 
in the curriculum, changes in what is rewarded in 
grading, changes in the competencies required of stu- 
dents, and changes in program enrollment. 

The predictability of college grades is also related to 
institutional characteristics (Baird 1983). Although 
some have argued that grades arc less predictable in in- 
stitutions that enroll students with a wide range of aca- 
demic ability and in institutions that have a very diverse 
curriculum, correlations between freshman grade-point 
average (FGPA) and standard predictors tend to be 
higher at institutions that enroll students with a wide 
range of academic ability because there is less restriction 
of range. After correcting for such restriction there is 
some evidence of very slightly reduced correlations at 
such institutions (Ramist and Weiss 1990). Sabot and 
Wakeman-Linn (1991) found considerable differences 
across departments in the extent to which grades in in- 
troductory courses could be predicted by other indica- 
tors, including ability measures such as the SAT. They 
found much better prediction of grades in the low- 
grading departments than in the high-grading depart- 
ments. There have also been concerns about bias asso- 
ciated with gender and race in the prediction of college 
performance. (See, for example, Hogrebe et ah 1983; 
Elliott and Strenta 1988; McComack and McLcod 
1988.) 

The discussion now turns to a consideration of how 
college grades may be affected by: (i) institutional poli- 
cies and practices; (2) variations in grading by depart- 
ment or discipline; (3) faculty attitudes about grading 
and faculty grading practices; and (4) changes in student 
achievement level and in the perceived importance of 
grades. 

Institutional Policies and 
Practices 

Students' CPAs may be subject to influence by a variety 
of institutional policies and practices, including admis- 
sion policies, required courses, how course grades are 
recorded, which courses and grades arc included in the 



GPA, the extent to which part-time faculty are used, 
and the way that student course evaluations are used. It 
is possible that institutional changes in these areas over 
the past two decades have affected the GPA. A survey of 
changes in grading practices between 1973-74 and 
1978-79 (Collins and Nickel 1979) found that policies 
were more stringent in 1978-79 concerning the use of 
nontraditional grading practices, makeup of incom- 
pletes, withdrawal policies, and policies related to re- 
maining in good standing, being placed on probation, 
or being suspended. But there were also more institu- 
tions reporting unlimited opportunities to repeat 
courses, not recording grades from a "bad" semester, 
and using the highest grade earned in a course to calcu- 
late the GPA. Unfortunately this area has received rela- 
tively little recent research attention. 

Changing admission standards at some colleges and 
universities during the past two decades may h?ve af- 
fected GPAs. Students who were admitted under lower 
standards may have been more likely to avoid courses 
known to have strict grading standards (Goldman et al. 
1974). Supporting this argument, Prather, Smith, and 
Kodras (1979) stated that rising cumulative GPAs are 
not caused by a lowering of grading standards in indi- 
vidual courses; rather, they are the result of changes in 
student course enrollment patterns over time. Specifi- 
cally, students are moving away from traditional 
courses, especially those known to be stiffly graded, to 
newer degree programs with more lenient grading stan- 
dards. Hills (1964) found that when one liberal arts col- 
lege raised its admission standards, students of equal 
academic ability (as indicated by their admission cre- 
dentials) then tended to receive lower grades. He hy- 
pothesized that this was due to a faculty commitment to 
assign grades to a class on a specified percentage basis, 
i.e., 15 percent A\s, 40 percent B's, etc. 

Some attention has been given to the impact of cur- 
ricuiar changes on college grades. Studies comparing 
general education requirements across time provide 
some insights into trends. In the 1960s, general educa- 
tion requirements represented 43 percent of the total de- 
gree requirements (Dressel and DiLisle 1969). By the 
early 1970s this had dropped to 33 percent (Blackburn 
et al. 1976), but it rose to 38 percent by the late 1980s 
(Toombs and Fairweather 1989). In their extensive na- 
tional survey of undergraduate institutions, Blackburn 
and his colleagues (1976) identified a radical change 
away from the traditional core curriculum toward more 
specialized and narrow fields of study. In more recent 
years, there appears to have been an increase in general 
education requirements, most notably in mathematics, 
but also in other areas. Approximately 30 percent of 
colleges and universities had a mathematics general ed- 
ucation requirement in the 1960s. This declined to 20 
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percent in the 1970s. However, Lewis and Harris (1989) 
found that 46 percent of four-year colleges and univer- 
sities had such a requirement in 1983 and 59 percent in 
1988. Toombs and Fairweather (1989) reported that 65 
percent of the institutions they surveyed had such a re- 
quirement in 1988. Eliminating core curriculum re- 
quirements, as occurred during the 1970s, has been 
linked to inflated cumulative CPAs (Suddarth 1975; 
Potter 1979). However, the effect of the more recent 
changes in general education requirements is unknown. 
The Lewis and Harris study found that general educa- 
tion requirements represented a larger proportion of 
total degree requirements at comprehensive and liberal 
arts institutions than at research and doctoral institu- 
tions (using the Carnegie classifications). 

Nontraditional courses compound the impact of 
curriculum change. Carney, Isakson, and Ellsworth 
U978) found that the increased use of field experiences, 
internships, and other nontraditional course formats as- 
sociated with new areas of specialization contributed 
even further to inflated cumulative CPAs because stu- 
dents generally received higher grades in such courses. 
Stein and Guthrie (1990) also found that students re- 
ceived higher grades in internship and field experience 
courses than in regular courses and those emphasizing 
laboratory work. 

The type(s) of grade reporting system used by an in- 
stitution appears to affect GPA. There is evidence that 
students in pass/fail courses would have earned lower 
grades had they been graded on the traditional basis 
(Fclder 1979; Geisinger and Rabinowitz 1979; Potter 
1979). Increasing the degree of differentiation in a 
grading system, such as by adding pluses and minuses to 
letter grades, may inflate the GPA. For example, Potter 
(1979) found that adding pluses and minuses to tradi- 
tional letter grades resulted in higher grades than when 
the traditional five-letter system was used alone. 

Institutional policies regarding how the GPA is 
computed will, of course, have an impact on grading 
outcomes. Research in this area has focused primarily 
on policies related to student withdrawal from courses 
and use of incompletes. Potter (1979) found that liber- 
alized incomplete and withdrawal policies at one insti- 
tution resulted in dramatically higher cumulative CPAs. 
Geisinger and Rabinowitz (1979) reported that students 
tended to withdraw from courses when their grades 
were low in order to protect their GPAs. A longitudinal 
study of policies concerning mrking up incompletes and 
withdrawals from courses foui.J that the policies were 
slightly more strict in 1978 than they had been five 
years earlier (Collins and Nickel 1979); this trend, the 
researchers concluded, would help reduce grade infla- 
tion. 



Duke concluded that, among the top students grad- 
uating from one institution in 1978. several had 
obtained their high standing because of administra- 
tive rules related to transfer credits, readmission, 
pass/fail courses, and repeated courses, which had a 
confounding impact on their GPAs. "The net result was 
that the top 19 students attained their high GPAs by de- 
cidedly nonequivalent accounting procedures. Hence, 
the resulting rankings were limited in reliability and va- 
lidity and led to inequitable comparisons" (1983, 
1049). 

Institutions and departments vary in their policies 
and practices concerning the use of part-time faculty. Of 
late the use of part-time faculty has been increasing. 
This may be a response to the rapid growth in some dis- 
ciplines. It is also often a means of dealing with tight 
budgets. One recent national estimate is that 35 to 40 
percent of faculty hold part-time appointments 
(Mangan 1991). Prather (1976) found that part-time 
faculty tended to give higher grades than full-time fac- 
ulty. One hypothesis is thac part-rime faculty are not 
sufficiently integrated into departmental activities to 
learn the grading norms used by their full-time col- 
leagues. Another possibility is that part-time faculty es- 
pecially in applied fields such as business, may empha- 
size competencies valued more highly in the workplace 
than in academe. 

Institutional and departmental use of students' 
course evaluations, especially when they are considered 
in faculty retention, tenure, and promotion decisions, 
may also be exerting upward pressure on grades. Fac- 
ulty who give higher grades tend to receive more favor- 
able evaluations from their students (Carney et al. 1 978; 
Longstreth 1979; Johnson and Beck 1988; Rasbow and 
Hernandez 1988; Zangenehzadeh 1988; Nirnmer and 
Stone 1 99 1 ). This suggests that, if faculty are aware that 
student evaluations will be used to make decisions 
about retention, promotion, tenure, and salary, they 
may be afraid to grade too harshly lest they receive un- 
favorable reviews and jeopardize their careers. We 
would anticipate that faculty concern about student 
evaluations would increase in periods when college fac- 
ulty have difficulty finding and keeping jobs. The value 
of student ratings as well as their potential for misuse 
has been discussed by Alcamoni (1987). 

Still another institutional factor that may have af- 
fected grades in recent years is an administrative effort 
to deal with perceived or actual grade inflation. Anec- 
dotal evidence suggests that some college administrators 
may be reviewing average grades in courses and calling 
faculty to task if their grades are noticeably higher than 
average. Other administrators are reported to be im- 
posing practices such as "grading on the curve* 1 as a 

10 



means of combating grade inflation or to create the ap- 
pearance of high institutional standards. 

Variations in Gradifig by 
Department or Discipline 

One of the most consistent findings in the literature on 
grading is the variation from department to department 
or discipline to discipline (sec Juoia 1968; Goldman, 
Schmidt, Hewitt, and Fischer 1974; Goldman and 
Slaughter 1 976; Prather, Smith, and Kodras 1 979; Duke 
1983; Willingham 1985; Strcnta, and Elliott 1987; 
Summerville, Ridley, and Maris 1988). The same disci- 
plines tend to be characterized by high or low grades 
when comparisons are made across institutions (Elliott 
and Strenta 1988; Summerville, Ridley, and Maris 
1988; Sabot and Wakeman-Linn 1991). There is less 
agreement about whether or not these differences have 
changed over time. Duke (1983) reported thai they ap- 
peared to be stable across time but Sabot and 
Wakeman-Linn (1991) found increasing divergence. 
College grades tend to be lower in the physical and bio- 
logical sciences and higher in education and in the arts 
and humanities (Duke 1983; Boli, Katchadourian, and 
Mahoney 1984; Wiliingham 1990b; Sabot and 
Wakeman-Linn 1991). 

This has become a matter for concern since students 
with comparable abilities may end up with widely dif- 
fering GPAs depending on the mix of courses they take. 
Several studies have suggested adjusting, or changing 
the use of GPAs to take these departmental differences 
into consideration. For example, Duke (1983) proposed 
use of centile grades to provide a measure of relative 
achievement. Elliott and Strenta (1988) have shown 
that the use of an index of differential grading when 
predicting GPA not only improves overall GPA predic- 
tion but also improves prediction for minority students 
and reduces apparent undcrprediction for women. 
However, McComack and McLeod (1988) reported 
that in most of the large introductory courses they 
studied, no gender bias was found that would hold up 
through cross-validation in another semester. 

One hypothesis to explain departmental differences 
in grading patterns is the "two cultures" phenomenon, 
whereby different standards apply in the sciences and 
other fields that emphasize knowledge of specific fac- 
tors and cumulative course content than apply in the 
humanities. These differences may also be related to the 
types of assessment used (Oh 1976; Kodras and Prather 
1978; Chase and Wakefield 1984). Faculty in more 
data-oriented disciplines, such as the physical sciences, 



tend to emphasize objective tests and quizzes and are 
less willing to adjust their grading standards to accom- 
modate individual student characteristics. In contrast, 
faculty in the humanities, fine arts, and education are 
less likely to use objective tests and more likely to con- 
sider factors such as effort and attitude when they as- 
sign grades. 

Etzioni (1975) has theorized that views of knowl- 
edge, held both within disciplines and by individual fac- 
ulty members, profoundly influence grading practices. 
According to Etzioni, faculty members who see knowl- 
edge as a formal and objective system tend to have 
higher grading standards than faculty who view knowl- 
edge as socially constructed and relative. Using this ar- 
gument, Etzioni attributed increases in GPA during the 
1970s to a broader acceptance of the socially con- 
structed and relative view of knowledge on the part of 
college faculty. EtzioniV; (1961) organizational theories 
have been used by Bromley, Busching, Oliver, and 
Szozda to compare grading in normative departments 
(those in which faculty are concerned with reaching out 
to students and transforming them) with grading in util- 
itarian departments (those that emphasize presentation 
of the subject matter). They found significant differ- 
ences in grading practices and argued that attempts to 
standardize grades across disciplines "may force unifor- 
mity which thwarts legitimate diversity in the academic 
goals of various departments" (1981. 434). 

A closely related finding is the significant difference 
in GPAs from school to school within larger institutions 
(Duke 1983; Stein and Guthrie 1990). This may be re- 
lated to differences in admission policies, academic 
standards, departmental policies, and/or required 
courses across schools. 

Class size and course level may also be related to 
grading differences, both across and within depart- 
ments. Grades appear to be inversely related to class 
size (Boli, Katchadourian, and Mahoney 1984; Dickson 
1984; Stein and Guthrie 1990), with the effect strongest 
in the freshman year, according to Dickson. Stein and 
Guthrie argued that smaller classes permit higher 
quality instruction, including better evaluation prac- 
tices. Gcisingcr and Rabinowitz (1979) provided a 
somewhat different perspective on the relationship be- 
tween class size and grades. They found that with larger 
cl.isses faculty preferred to use a norm-referenced ap- 
proach to grading, which usually yields lower grades 
than when factors such as effort are included. One ex- 
planation may be that larger classes do not lend them- 
selves to high leveis of faculty-student contact, thereby 
making it more difficult to evaluate each student's indi- 
vidual progress in the course. Another explanation may 
be thai the larger classes tend to be introductory-level 
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courses or courses required of all students, while the 
smaller classes tend to be higher-level courses taken by 
more advanced students. Stein and Guthrie (1990) 
found a significant relationship between course level 
and mean CPA, with higher grades in the more ad- 
vanced courses. They argued that the distribution of 
grades should be expected to differ from freshman 
courses to senior courses, Bogart and Kistler (1987) 
found that English composition instructors in both 
community colleges and state universities tended to 
apply and to give equal importance to the same criteria 
for grading in normal situations; this suggests that dis- 
ciplinary differences transcend institutional type. 

Faculty Grading Practices 

A number of explanations have been offered for the 
variability of faculty grading practices. One argument is 
that professors have different philosophies regarding as- 
sessment that affect their grading practices. Drcsscl 
(1961) has identified three types of faculty members, 
based on their grading practices and philosophies: (1) 
traditionalists, who emphasize formal, objective mea- 
surement of knowledge and arc critical of lax grading 
standards; (2) eclectics, who feel that, because the col- 
lege experience is pluralistic, giading standards must 
recognize differing student backgrounds and interests; 
and (3) relativists, who view education as a contiruing 
process of improvement and feel that grading interferes 
with actual learning. Etzioni (1975) has commented 
that declining standards may result when there is an at- 
tempt to combine these different approaches to grading 
into one system. 

Geisinger and Rabinowitz (1979), building on the 
work of Thorndikc (1969), identified three orientations 
toward grading: (1) norm-referenced, in which faculty 
evaluate students relative to the performance of others 
in the class; (2) criterion-referenced, in which students 
arc evaluated in terms of previously set standards; and 
(3) self-referenced, in which faculty evaluate each stu- 
dent relative to his or her abilities, motivation, and/or 
past performance. In the early 1960s, according to 
Geisinger and Rabinowitz, norm-referenced assessment 
was common but there was a shift away from it in the 
middle to late 1960s toward other orientations, espe- 
cially the self-referenced orientation. In the middle to 
late 1970s faculty appear to have favored the criterion- 
referenced orientation (Juola 1974; Carney, Isakson, 
and Ellsworth 1978; Bcllott 1981). This was evident in 
the use of mastery models of learning that emphasized 
objective-based instruction, learning modules, and per- 
formance contracts. Geisinger and Rabinowitz found 
that faculty with a norm-referenced orientation gave 
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lower grades than those with other approaches to 
grading. These findings suggest that rising CPAs may be 
the product of changes in faculty orientations toward 
grading. Faculty grading orientation appears to vary 
across types of institutions. In a study comparing atti- 
tudes at a university, a four-year college, and a two-year 
college, Geisinger and his colleagues (1980) found that 
university faculty were more norm-referenced than col- 
lege faculty, and that community college faculty were 
most likely to advocate the self-referenced approach to 
grading. As Hanna and Cashin (1988) have pointed 
out, both norm-referenced (class curve) and criterion- 
referenced (percentage) grading systems lack a sound 
rationale; these authors recommend the use of anchor 
measures. 

Raths, Wojtaszek-Healy, and Della-Piana (1987) 
concluded that there arc two principal approaches to 
grading: (1) translating the grading problem into in 
arithmetic algorithm, giving points for subunits, 
weighing them according to the instructor's view of 
their importance, and establishing cutoffs for specific 
grades; and (2) using a more intuitive and less pre- 
dictable approach, such as assigning grades based on a 
general impression of how well the work meets the in- 
structor's expectations. 

Ory and Ryan (1993) provided an excellent discus- 
sion of the advantages and disadvantages of using nor- 
mative versus absolute grading methods. They argued 
that grading on the basis of improvement may be un- 
fair; they consider grading on the curve, the distribution 
gap method, and percentage grading to be less defen- 
sible than either normative or absolute grading 
methods. 

Faculty in different departments may prefer dif- 
ferent grading orientations because they have different 
views about the purposes of grading. A survey of faculty 
at a state university (Hambleton and Murray 1977) 
found that grading was given the highest importance as 
a means of providing students with feedback on their 
course performance, followed in importance as a means 
of informing others about student performance and, 
then, as a means of motivating students to do good 
work. Faculty in the education department placed less 
importance on grading as a means of motivating stu- 
dents than faculty in other departments. Most faculty 
surveyed felt that pass/fail and pass/not recorded 
grading systems failed to meet the most important pur- 
poses of grading. Criterion-referenced grading was pre- 
ferred over norm-referenced grading for meeting these 
purposes. These faculty believed that the most impor- 
tant course outcomes were to maximize the amount of 
learning, to maximize student enthusiasm, and to max- 
imize student performance. They saw criterion-refer- 
enced grading as most likely to bring about thc.ie out- 
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comes. Similar studies have been done at other institu- 
tions. For example, Haughey 0977) found that the fac- 
ulty at a state college believed that grades were an im- 
portant motivator for students and that standards 
should be the chief criterion in determining grades. 

Other researchers have theorized that faculty adapt 
their grading standards to the ability level of their stu- 
dents (Aiken 1963; Hills and Gladney 1968; Goldman, 
Schmidt, Hewitt, and Fisher 1974; Goldman and 
Slaughter 1976; Gallini '2). That is, when student 
ability increases, faculty tend to use stricter grading 
standards and, when student ability decreases, grading 
standards are lowered. Adaptation-level theory has 
been used to relate this to differences in department 
grading standards. The argument has been made that 
lower-ability students migrate toward courses and fields 
that have lower grading standards. This, in turn, leads 
to higher expectations of students who take the more 
demanding courses as faculty adapt to the high ability 
of the class and grade more rigorously. Adaptation-level 
theory has also been used to explain grade inflation. For 
example, McKenzie and Tullock (1981) argued that as 
college admission standards drop and the quality of stu- 
dents decreases, faculty adapt by graJing more le- 
niently. Elliotr (personal communication) has found 
that, at a variety of institutions, prospective science ma- 
jors have higher mean scores on college admission tests 
than the average for all students entering the institution. 
This, along with adaptation theory, might explain the 
higher grading standards in science departments. Sum- 
merville and his colleagues (1988) concluded that de- 
partments do not award high grades because they at- 
tract high-achieving students. Their findings, that 
students in departments that traditionally assigned low 
grades received higher grades in their other courses, is 
consistent with adaptation theory. 

Faculty may give greater or lesser emphasis to dif- 
ferent grading criteria depending on the situation. 
Bogart and Kistler (1987) found no difference, overall, 
in the grading criteria used by English department fac- 
ulty in community colleges and state universities nor in 
the importance they assigned these criteria in normal 
situations. In borderline situations, however, commu- 
nity college faculty gave significantly more importance 
to "final exam scores" and university faculty gave sig- 
nificantly more importance to "adherence to due dates 
and deadlines" and to "improvement shown since the 
beginning of the course." 

Hughey and Harper (1983) found that a substantial 
portion of a course grade was the result of how the in- 
structor differentiated among students. The communi- 
cation responsiveness of the student (including things 
such as the kind of communication climate the studeti' 
creates, the way the student transmits information, and 



the way the student deals with communication barriers) 
had a "pervasive influence" on final course grade. Fe- 
males were more likely to get a high grade (gender ac- 
counted for 15.8 percent of the variance in grades). 

Faculty characteristics, such as rank, seniority, 
tenure status, and age, may influence grading practices. 
Prather (1976) found that graduate teaching assistants 
and instruct rs gave higher grades, on average, than 
other undergraduate faculty. This suggests that the 
growing use of part-time faculty may be related to grade 
inflation. Boozer (1977) however concluded that 
grading patterns are idiosyncratic; he found no rela- 
tionship between faculty rank, degree, tenure, and age 
and grades given. Dickson (1984) was also unable to 
support r'ne hypothesis that junior faculty mark more 
easily. 

In addition to varying at any one point in time, it is 
likely that faculty grading practices also vary over time. 
This is suggested by the increases in GPA observed since 
the late 1960s. These increases remain substantial even 
after controlling for the academic preparation of the 
students (Goldman, Schmidt, Hewitt, and Fisher 1974; 
Goldman and Slaughter 1976; Summerville, Ridley, and 
Maris 1988). Faculty may have experienced changes in 
attitudes or grading philosophies during the past two 
decades. After all, during this period the sorting func- 
tion of education has come under severe attack (Rist 
1970; Bowles and Gintis 1976; Oakes 1985). Faculty 
may have become increasingly sensitive to the role 
grades play in the sorting process and, as a result, re- 
sponded by grading more leniently. Also, the introduc- 
tion of student course evaluations and the greater use of 
part-time faculty during the past two decades may have 
contributed to grade inflation. It is also possible that 
faculty have modified their grading practices as a re- 
sponse to changes in the student body, including 
changes in academic preparation as well as changes in 
sex, race/ethnicity, and age. 

Changes in Student Achievement 
Level and in the Perceived 
Importance of Grades 

Grades may have risen because students now achieve at 
higher levels than did students in the past or because 
students are now more grade-oriented, thus making fac- 
ulty feel they should award higher grades. Johnson and 
Beck (1988) found that students with low SAT scores 
are more grade-oriented than students with higher SAT 
scores. 

In a survey by the Carnegie Foundation for the Ad- 
vancement of Teaching (1989), 70 percent of faculty 
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agreed with the statement "Undergraduates have be- 
come more grade conscious." However, 55 percent of 
faculty also agreed with the statement "Most under- 
graduates at my institution only do enough to get by." 
Agreement with the second statement was higher at 
comprehensive colleges and universities (defined as in- 
stitutions in which more than half the baccalaureates 
arc given in occupational or professional disciplines) 
than at research universities, doctoral universities, or 
'liberal arts colleges, but there was essentially no varia- 
tion in agreement levels for the first statement. This sug- 
gests that faculty in comprehensive colleges and univer- 
sities may feel pressured to give good grades to students 
who arc performing at only marginal levels of achieve- 
ment, possibly because occupational and professional 
course grades have a more direct impact on career out- 
comes than is the case for liberal arts course grades. 

Students also respond differently to strict or lenient 
grading standards. Johnson and Beck (1988) found that 
students with low SAT scores did better when they were 
graded on a strict scale and that grading scale variations 
had their greatest influence on students with low SATs. 
Still other researchers, such as Sarafina and DiMattia 
(1978), have argued that grades arc undesirable because 
they reduce students' intrinsic motivation. 

The relationship of changing student achievement 
levels, changing orientation to grades, and reactions to 
grading standards merits further research. 

In sum, the literature on grades and grading prac- 
tices suggests that the observed differences across insti- 
tutions and departments, as well as the rise in GPAs that 
has been reported at many institutions, especially 
during the 1970s, may have been the result of changing 
institutional or departmental policies and/or practices 
or of changing attitudes and behaviors among faculty. 
From the institutional perspective, GPAs may be related 
to admission standards, curricular requirements, grade 
reporting systems, the use of part-time faculty, the use 
of student course evaluations, and administrative pres- 
sures to limit grade inflation or to present the appear- 
ance of a uniform grading system. At the departmental 
level, many of the same factors apply. In addition, de- 
partmental differences in grading may be related to dif- 
fering views of knowledge and approaches to assess- 
ment across disciplines, as well as to variations in class 
size and the level of the courses offered. The grades of 
individual faculty arc probably also affected by their 
differing views of knowledge, the kinds of assessment 
they use, and their views about the purposes of assess- 
ment. Faculty may respond differentially to pressures 
from students to award higher grades. There is some ev- 
idence to indicate that part-time faculty, graduate assis- 
tants, and instructors grade less strictly than full-time 
faculty. 



Although the lirerature on grades and grading prac- 
tices in higher education is extensive, it nonetheless con- 
tains many gaps. For example, we do not know the ex- 
tent to which differences in grading across disciplines 
result from differences in departmental policies, faculty 
attitudes, or differences in the types of students en- 
rolled. Nor do we know if or how faculty grading 
philosophies arc related to grading practices. In addi- 
tion, there arc dangers in trying to reach conclusions 
about current grading practices on the basis of the re- 
search summarized here because some of this research 
may be outdated; there were relatively few empirical 
studies of grading during the 1980s. As a result, we 
know little about current grading practices in higher ed- 
ucation, how these may have changed during the past 
decade, and why. This study is a first step toward ad- 
dressing some of these topics. 

Method 

This study sought information to answer four ques- 
tions: 

• What are the current institutional and/or depart- 
mental policies and practices related to grading? 

• Were there changes in these policies and prac- 
tices between 1980 and 1990? If so, what was 
the nature of these changes? 

• What arc the current grading practices and atti- 
tudes of college faculty who teach undergraduate 
courses? Have these changed over time? 

• Do faculty grading orientation and practices 
differ across departments? 

Instruments 

Based on the preceding review of the literature and on 
suggestions from researchers who have previously 
studied college grading practices, three questionnaires 
were developed: (1) an institutional questionnaire, (2) a 
departmental questionnaire, and (3) a faculty question- 
naire. The rationale for this three-questionnaire ap- 
proach was that grades are likely to reflect institutional 
and departmental policies and expectations as well 
as individual faculty members' attitudes and grading 
approaches. 

The institutional and departmental questionnaires 
were designed to ascertain if there were pressures on 
faculty that might affect their grading practices. The in- 
stitutional questionnaire asked about the nature of the 
grade reporting system, policies related to grading, rc- 
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quired courses and other curriculum matters, and insti- 
tutional use of student evaluations, as well as for infor- 
mation about institutional characteristics. 

The departmental questionnaire focused on depart- 
mental grading policies and/or expected grading prac- 
tices, how these are communicated to faculty members, 
the extent of review of grades by department chairs, and 
the frequency of departmental discussions about 
grading. The rationale was to determine the extent to 
which departments try to develop some consensus 
about grading. 

The faculty questionnaire was the most complex in- 
strument of the three. It began by asking faculty about 
their institution and department, including their percep- 
tion of student ability, in order to determine if there was 
a relationship between grading practices and perceived 
student characteristics. Faculty were then asked about 
their perceptions of the pervasiveness of departmental 
policies about grading to determine how much these 
might affect grading practices, and their perceptions of 
departmental efforts to raise grading standards, which 
might help to inform changing relationships between 
test scores and grades. The next section of the faculty 
questionnaire dealt with attitudes about grading, 
changes in grading philosophy, and opinions about the 
purposes of grading. This section also included ques- 
tions about faculty perceptions of variations in the 
meaning of course grades, both across institutions and 
across disciplines, and faculty perceptions as to why un- 
dergraduate grades now tend to be higher than they 
were 20 years ago. 

Questions about attitudes toward grading were in- 
cluded so the findings of this study could be related to 
DressePs (1976) and Etzioni's (1975) theories about the 
relationship between grading philosophies and grading 
practices. Questions about the purposes of grading were 
included to permit exploration of Hambleton and Mur- 
ray's (1977) finding that faculty in different depart- 
ments hold differing views on this topic. A question 
about changes in grading philosophy was included to 
provide some information about the extent to which 
grade inflation may be related to changing faculty views 
rather than to institutional and/or departmental factors. 
A question about the reasons for grade inflation was in- 
cluded to determine if faculty felt the changes were re- 
lated to changes in the institution, in the students, or 
among faculty. 

The next section of the faculty questionnaire dealt 
with grading practices, including grading orientation 
(criterion-referenced, norm-referenced, or self-refer- 
enced, as described by Geisinger and Rabinowitz 1979). 
To obtain a better understanding of the factors that fac- 
ulty consider in assigning grades and of how informa- 
tion about student learning is obtained, this section also 



asked questions about the importance of various factors 
in grading lower-division, upper-division, and field ex- 
perience/internship courses, as wel! as the types of as- 
sessment used. As noted previously, other researchers 
have found that the factors included in a grade and the 
kinds of assessment vary, both by department and by 
course characteristics. A brief section on student evalu- 
ations of faculty was included to determine the extent to 
which faculty feel pressured by this practice. Finally, a 
series of background information questions (academic 
rank, years in teaching, tenure status, degree, age, sex, 
and race/ethnicity) were included. 

Institutional Sample 

The original sampling plan was to select 12 institutions, 
dividing the sample evenly between public and private 
and between less selective and more selective institu- 
tions. The rationale for the sampling plan was based on 
findings that SAT validity has tended to decline more in 
public colleges than in private colleges and more in in- 
stitutions where the average entering student's SAT 
score (verbal and math combined) was below 950 than 
in institutions where the average entering student's SAT 
score was above 950 (Morgan 1989). One hypothesis 
about this decline in validity is that it might be related 
to changes in college grading practices. For this reason, 
the sample included some institutions that had partici- 
pated in a previous study of the predictive validity of the 
SAT (Ramist, Lewis, and McCamley 1990). We also 
wished to include in this study institutions that had 
been the subjects of previous research on grading, so 
previous findings might help inform our own results. 
The institutions were not intended to be a representative 
sample and, therefore, no generalizations to other col- 
leges and universities should be made. 

The final sample consisted of 14 institutions. Of 
these, eight (57 percent) are public and six (43 percent) 
are private. Using the Carnegie classifications, three are 
research institutions, three are doctoral institutions, five 
are comprehensive institutions, two are liberal arts in- 
stitutions, and one is a specialized institution. Three of 
the 14 institutions reported that the highest degree they 
award is the bachelor's; 5 award the master's; and 6 
award a doctorate. Only 3 of the colleges and universi- 
ties in this study had participated in the previous va- 
lidity study, and they are all private institutions that 
would be rated "more selective" in admission. 

The institutions ranged in size from approximately 
2,000 to over 22,000 full-time undergraduate enroll- 
ments. Part-time enrollments ranged from under 100 to 
approximately 6,000. Average total undergraduate en- 
rollment (both full time and part time) was 18,322 at 
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the research institutions, 7,691 at the doctoral institu- 
tions, 4,769 at the comprehensive institutions, and 
3,108 at the libera! arts institutions, Part-time students 
represented approximately 15 percent of the tofal un- 
dergraduate enrollment at the research institution*, 38 
percent at the doctoral institutions, 34 percent at the 
comprehensive institutions, and 1 percent at the liberal 
arts institutions. 

The 14 institutions vary considerably in selectivity. 
According to the College Handbook, which provides 
data on average SAT scores for the middle 50 percent of 
entering students, the combined SAT (verbal plus math- 
ematical) score at three of the institutions was below 
950; these institutions are designated "less selective'' in 
this analysis. At six, the average combined SAT score 
for the middle 50 percent of entering students ranged 
between 950 and 1100; these are designated "moder- 
ately selective." At four of the institutions, the average 
combined SAT score was above 1100 for the middle 50 
percent of entering students; these are designated "more 
selective." No data on selectivity were available for the 
specialized institution, so it was excluded from all 
analyses involving selectivity; the best estimate is that it 
is in the more selective category. (It described itself as 
very selective when responding to the institutional ques- 
tionnaire.) When asked to rate the selectivity of their 
current (1990) admission policies, one institution de- 
scribed itself as having open admission, seven said they 
were moderately selective, and six said they were very 
selective. When asked about their admission policies in 
1980, three said they had open admission at that time, 
six said they were moderately selective, and five said 
they were very selective. Comparisons of institutional 
self-reports of current selectivity and selectivity as re- 
ported in the College Handbook are shown in Table 1; 
agreement was high. 

The public institutions in this sample were more 
likely to award the doctorate than the private but they 
also appeared to have somewhat less selective admission 
policies. Public institutions were much more likely than 
private ones to have a Carnegie classification of com- 
prehensive, indicating that at least half their bachelors 
degrees were awarded in applied areas. There were no 
public liberal arts colleges in this sample. 

Department and Faculty Sample 

In order to obtain information about grading practices 
in a variety of academic areas, seven departments and 
the faculty members in them were selected for study, 
given that the institutions had such departments. In se- 
lecting departments, the rationale was to include those 
in which freshman students arc likely to enroll; to in- 
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Institutional Selectivity in 1990: 
Sclf-Rcportcd Versus College Handbook 


College Handbook Estimate 


Self-Reported 




Open 


Moderately 
Selective 


Very 
Selective 


Less selective 1 


2 


0 


Moderately selective 0 


6 


0 


More selective 0 


0 


4* 


* Does nor include the specialized institution estimated to be more* selective 
but for which no College Handbook data was available. 



elude the sciences, social sciences, and humanities; and 
to include departments with an occupational orienta- 
tion as well as departments in the liberal arts. Efforts 
were made to select departments that were reported to 
vary in their grading standards or orientation. The de- 
partments selected were: business, chemistry, education, 
English, history, mathematics, and psychology. All fac- 
ulty members in each of the selected departments were 
included in the faculty sample. 

Data Collection 

Each institution that agreed to participate in this study 
was sent a copy of the institutional questionnaire. It had 
the option of completing and returning the question- 
naire or of answering the questions during a telephone 
interview; most chose the latter but also provided sup- 
plementary written materials. 

The institutions were asked if they had any of seven 
targeted departments and, if so, to provide the name of 
the department chairperson and a roster of the faculty. 
All 14 colleges and universities had undergraduate de- 
partments in chemistry, English, history, and mathe- 
matics. All but one had a psychology department. Seven 
had an undergraduate business department and nine 
had an undergraduate education department. 

Each of the 85 department chairpersons was sent a 
letter explaining the study, a copy of the departmental 
questionnaire, and a copy of the faculty questionnaire. 
The department chairs were asked to complete both a 
departmental and a faculty questionnaire and return 
them to Educational Testing Service (ETS). A month 
later, each received a follow-up postcard asking them to 
return their questionnaires if they had not already done 
so and to encourage faculty in their department to re- 
turn the questionnaires. Completed questionnaires 
were returned by 68 percent of the department chairs 
(58 individuals). 

Faculty questionnaires were sent to all individuals 
on the rosters of smaller departments and to all cur- 
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rently active full-time faculty (to the extent that part- 
time, adjunct, and emeritus faculty could be deter- 
mined) in the larger departments. Therefore these 
findings may underrepresent differences between full- 
time and part-time faculty. In most cases arrangements 
were made for the faculty questionnaires to be returned 
directly to ETS but, in a few cases, departmental coop- 
eration was contingent on the questionnaires being re- 
turned to the department chairs and then forwarded to 
ETS. Anonymity of the respondents was important and 
it was not possible to do individual follow-ups with fac- 
ulty who did not return their questionnaires. The total 
number of faculty questionnaires distributed was 2,1 80. 
Completed usable questionnaires were returned by a 
disappointing 25 percent of the faculty (542 indi- 
viduals). Return rates varied considerably across insti- 
tutions. 

Results 

Institutional Analysis 

Curriculum 

As discussed previously, curriculum changes occurred at 
many colleges and universities in the period between 
1980 and 1990. The 14 responding institutions were no 
exception. Overall, the picture showed a move toward 
greater specification of curriculum. While none of these 
institutions had put a core curriculum in place during 
the decade, seven had instituted or added general edu- 
cation courses that all students were required to take 
and one had instituted such courses for specified stu- 
dents. The newly required courses included history, 
writing, non-Western culture, and foreign language. 
Four of the institutions adding new course requirements 
are privately controlled and more selective; three are 
public and rate themselves as moderately or very selec- 
tive. An additional four of the responding institutions 
had added or instituted distribution requirements 
whereby students had to choose courses in specified 
fields or areas. None of the institutions reported having 
eliminated either a core curriculum or required courses, 
but one reported eliminating distribution requirements. 

Three of the 14 institutions said they had made no 
change between 1 980 and 1 990 in the number and type 
of courses required of undergraduates, but one of these 
reported that plans for changes were in progress. The 
two institutions that had made no curriculum changes 
and had no immediate plans to do so are both public in- 
stitutions that award doctoral degrees; one is less selec- 
tive and one is moderately selective. Both these findings 



and those from other recent surveys (Lewis and Farris 
1989; Toombs and Fairweather 1989) suggest that 
changes in general education requirements between 
1980 and 1990, such as the addition of required courses 
for all students, may have affected college grades, espe- 
cially in the freshman year. Narrowing students' course 
choices makes it more difficult for them to avoid 
courses in which they believe they will not do well. 
Also, requiring a course is often associated with larger 
class size, which may also have affected grades. Both of 
these factors would be expected to result in lower 
grades. 

Grading Systems 

The colleges and universities responding to the survey 
were asked to indicate all grading systems currently 
used and, also, all systems used in 1980. The results, 
shown in Table 2, suggest that these institutions moved 
toward a more differentiated grading system than they 
had in 1980. One institution had used descriptive eval- 
uations in 1980 but had abandoned this system by 
1990. 

Twelve of the 14 colleges and universities used 
pass/fail and/or credit/no credit grading in 1990. 
Among these there v/as a tendency to place some limi- 
tations on this type of grading. Two-thirds limited the 
number of courses students could take under pass/fail or 
credit/no credit grading. Most (83 percent) did not 
allow students to receive pass/fail or credit/no credit 
grades in their major. And in most of these institutions 
relatively few students (estimates generally ranged from 
1 percent to less than 5 percent) received credit/no 
credit or pass/fail grades; however, one university re- 
ported that approximately 20 percent of its students re- 
ceived this type of grade. 

Each institution was asked how it defined an "av- 
erage" grade in an undergraduate course. Seven of the 
institutions indicated that an average undergraduate 
grade was defined as C, equivalent to 2.0. Five replied 
that there was no official definition; of these one indi- 
cated that C was accepted as average performance while 
another said that empirically the average was B-. One 



table 2 



Number of Institutions Using Various Gracing Systems in 


1990 and in 1980 




1990 1980 


Five-letter system (A,B,C,D,F) 


7 9 


Letter system with plus and minus 


8 4 


Pass/fail 


10 11 


Credit/no credit 


7 7 


Descriptive evaluations 


0 1 
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institution said the average undergraduate grade, de- 
fined on a bell-shaped curve, was 2.62 or C+. One said 
that it defined the average undergraduate grade as 2.7. 

While a majority of the 1 4 institutions said they had 
no unusual grading policies or practices, five reported 
policies that they believed to be unusual. These in- 
cluded: (1) not giving grades of A+; (2) differing grading 
policies among the university's undergraduate schools 
and colleges; (3) rules regarding recording of oass/fail 
grades, including whether or not the transcript indicates 
there is a nonrecorded grade; (4) changing from having 
no final exams and no grades, except narrative evalua- 
tions, in 1980, to using traditional letter grades almost 
exclusively in 1990; and (5) waging a campaign from 
1982 to 1990 to "stem grade inflation," which was re- 
ported to have reduced the overall undergraduate GPA 
from 3.0 to 2.7. 

Institutional Grading Policies 

The responding institutions were asked about their cur- 
rent (at the time of questionnaire completion in 1990) 
and 1980 policies in five areas related to grading: (1) in- 
completes and withdrawals; (2) recording grades; (3) 
computing the GPA; (4) other grading policies; and (5) 
GPA requirements. These are summarized in Table 3. 

As Table 3 indicates, grading policies at these insti- 
tutions remained fairly stable between 1980 and 1990. 
The overall picture is one of fairly tight institutional 
control over time limits for withdrawals from courses. 
All 14 institutions placed some restrictions on with- 
drawal from a course. In addition, passing grades were 
required for course withdrawal in nearly half the insti- 
tutions. Only one allowed students to withdraw from a 
course at any time and without penalty. With regard to 
making up incompletes, all the institutions had a time 
limit in 1990 and all but one had a limit in 1980. 

Institutional control over grades was also evident; 
at most institutions students had to meet minimum GPA 
requirements to remain in good standing and to grad- 
uate. In addition, at over half the institutions students 
were placed on probation when their GPA fell below 
the stipulated minimum. At 6 of the 14 institutions, the 
policy of using both grades from repeated courses when 
computing the GPA established some control over grade 
inflation. But policies pertaining to how the GPA is 
computed that might increase GPA inflation were noted 
as well. These included using only the last grade from 
repeated courses and using grades from remedial 
courses when computing the GPA. 

Student Evaluation of Faculty 

Of the 12 institutions responding to the question about 
student evaluation of instructors, 1 1 reported having 

n 



TABLE 3 



Summary of Institutional Grading Policies 
add in 1980 


in 1990 






1990 


1980 


A. Incompletes and withdrawals 


Time limit for making up incompletes 


14 


13 


Withdrawal at any time without penalty 1 i 


Withdrawal only in certain time period 


12 


12 


Passing required for withdrawal 


0 


0 


Withdrawal only in certain tune period 
and if passing 


6 


6 


B. Recording grades on transcript 


Second gr<- ic replaces first when 
repeating a course 


3 


3 


Failing grades not recorded 1 1 


D*s not recorded 


0 


0 


C. Computing the GPA 


Use only grades from courses that 
count toward degree 


) 


2 


Use both grades from repeated courses 


6 


6 


Use last grade from repeated courses 5 5 


Include grades in remedial courses 


4 


4 


D, Grading policies (not usual practices) 


Course average must be between B and C 


0 


{) 


Must grade against specific standards 


3 


3 


Must grade relative to overall class 


0 


0 


Must grade relative to siuden: ^ ability 1 1 


Must grade on progress toward 
individual objectives 


0 


I) 


Must include effort in grade 


0 


0 


Must include attendance in grade 1 1 


Must grade on curve 


0 


0 


E. GPA requirements 


Minimum GPA required for good standing 


12 


12 


Probation when GPA falls below minimum 


12 


12 


Probation when any semester below minimum 


6 


5 


Students on probation suspended 

when semester GPA is below minimum 7 7 


Students remain on probation until 
overall GPA rises above minimum 


S 




Minimum GPA required for graduation 


12 


12 


Minimum GPA in ma»or required 
for graduation 


S 


8 



such evaluations. Among these institutions, 1 1 used stu- 
dent evaluations as part of faculty retention, promotion, 
and tenure decisions in 1990 and 8 used student evalu- 
ations in such decisions in 1980; the increase occurred 
in public, moderately selective institutions. One institu- 
tion allowed faculty to determine whether or not stu- 
dent evaluations would be used in making promotion 
and tenure decisions. Five institutions said they used 
student evaluations as part of faculty merit pay deci- 
sions in 1990, while only two made this use of student 
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TABLE 4 



TABLE 5 



Institutional Uses of Student Evaluations of Faculty in 1990 
and in 1980 (N-U) 





1990 


1980 


Faculty retention decisions 


11 


8 


Faculty promotion decisions 


1 1 


8 


Tenure decisions 


11 


8 


Merit pay decisions 


5 


2 



evaluations in 1980. This increase occurred in less and 
moderately selective public institutions. Four reported 
using student evaluations to make decisions about 
teaching awards. Three institutions said they used stu- 
dent evaluations to help faculty modify a course or im- 
prove teaching methods. These data are summarized in 
Table 4. 

Department Chairperson Analysis 

Questionnaires were received from 59 department 
chairpersons; 3 from business, 11 from chemistry, 5 
from education, 8 from English, 12 from history, 7 
from mathematics, 9 from psychology, and 4 for whom 
departmental identification was not available. Slightly 
more than half (55 percent) of the department chair re- 
sponses came from public institutions and 45 percent 
from private institutions, closely reflecting the represen- 
tation in the institutional sample. About a third of the 
responses were from comprehensive institutions, about 
22 percent from research institutions, another 22 per- 
cent from doctoral institutions, and about 20 percent 
from liberal arts institutions. About 20 percent of the 
department chair responses came from less selective, 
about 43 percent from moderately selective, and about 
37 percent from more selective institutions. 

None of the department chairs reported having spe- 
cific departmental grading policies and only one depart- 
ment, the history department at one of the more selec- 
tive private institutions, reported having a specific 
expected grading practice. (Expected practices are no- 
formal ly stated, but are transmitted to department 
members through informal means.) Thus it appears that 
departmental grading policies and explicitly stated de- 
partmental grading expectations played a small role in 
the grading process. 

Sixty-two percent of the department chairs re- 
sponding to the questionnaire said they routinely re- 
viewed the grades faculty gave in courses in their de- 
partments. Variations by department and by- 
institutional characteristics are shown in Table 5. It is 
not known whether or not faculty grading practices 



Percentage of Department Chairs Reviewing Faculty Grades 



A. 


By department 






Education 


ino 




Mathematics 


86 




English 


75 




Business 


67 




History 


64 




Chemistry 


55 




Psychology 


33 


B, By institutional selectivity 




Less 


100 




Moderate 


52 




More 


29 


c 


By Carnegie category 






Comprehensive 


76 




Doctoral 


64 




Liberal arts 


44 




Research 


18 


n. 


By institutional control 






Public 


64 


Private 45 



were affected by knowledge that the grades would be 
reviewed by the department chair. 

Although many department chairpersons evidenced 
a concern about grading practices through their routine 
reviews of the grades given, only about a third said they 
ever had formal meetings of department faculty to dis- 
cuss grades. Of these, about 80 percent had one or two 
and 20 percent had three to five such discussions per 
year. The frequency of such discussions, by department 
,and By selected institutional characteristics, is shown in 
Tabic 6. It seems likely that faculty in departments that 
have formal discussions of grading will have a more 
uniform view of grading standards and practices than 
faculty in departments that do not have such discus- 
sions. 

Informal discussions of grades were irore common 
and were reported by approximately 75 percent of the 
department chairs. Half of the department chairpersons 
reported having informal discussions about grading 
with faculty one or two times a year, while another 32 
percent reported having such discussions three to five 
times a year and the remaining 18 percent reported in- 
formal discussions about grading from once a month to 
once a week. The frequency of these discussions, by de- 
partment and by institutional characteristics, is shown 
in Table 6. 

The evidence from this portion of the analysis indi- 
cates that departmental influences on grading work pri- 
marily through informal and formal discussions about 
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TABLE 6 



Percentage of Department Chairs Reporting Formal and 
Informal Discussions of Grading, by Departmental and 
Institutional Characteristics 


Formal Informal 


A, By department 


Education 


60 


100 


Chemistry 


45 


91 


Psychology 


44 


89 


History 


33 


67 


Mathematics 


14 


71 


English 


12 


63 


Business 


0 


66 


B. By institutional selectivity 


Less 


40 


100 


Moderate 


38 


67 


More 


l^ 


45 


C. By Carnegie category 


Comprehensive 


47 


88 


Research 


36 


45 


Liberal Arts 


20 


80 


Doctoral 


9 


73 


D. By institutional control 


Public 


39 


"8 


Private 


22 


"0 



grading rather than through explicitly stated depart- 
mental grading policies or prescribed practices. The re- 
sponses suggest that departmental influences on grading 
are most likely to be found in less selective colleges and 
universities and in education, chemistry, and psy- 
chology departments. The rather limited amount of dis- 
cussion about grading reported in English departments 
implies that grading practices may vary more in this 
field than in others. Discussions about grading were re- 
ported most frequently by department chairs in less se- 
lective institutions and in comprehensive institutions. 
This suggests that department chairs in institutions that 
enroll students more diverse in ability and goals may see 
a greater need to help faculty develop a consensus on 
grading than do department chairs in institutions in 
which students are more similar. 

Individual faculty Member 
Analysis 

This section reports the results from questionnaires 
completed by 597 faculty members. Department chairs 
with teaching responsibilities also completed a faculty 
questionnaire. 



TABLE 7 



Number of Faculty Responding by Field and 
Institutional Type (N = 597) 




Public 


Private 


Unknown 


Total 


Percent of 
Sample 


Business 


40 


16 


13 


69 


10.1 


Chemistry 


73 


30 




110 


18.4 


Education 


38 


13 


1 


52 


8.7 


English 


76 


36 


2 


114 


19.1 


History 


35 


39 


4 


7 8 


13.1 


Mathematics 


72 


14 


S 


94 


15.7 


Psychology 


45 


32 


2 


79 


13.2 


Field unknown 


10 


0 


0 


10 


1.7 



Table 7 shows the number and percent of faculty 
responding, by field and by institutional control. Fac- 
ulty from English and chemistry departments are 
most frequently represented in the sample, while 
education department faculty are less frequently repre- 
sented. 

Table 8 summarizes faculty background informa- 
tion. Nearly half the responding faculty said they held 
the rank of full professor and about three-quarters 
had received tenure. Nearly all the respondents taught 
full time and nearly all heid a doctoral degree. Slightly 
more than half the respondents had been teaching at 
the college level for 16 years or longer. About three- 
quarters were male and nearly all were white. More 
than half were under 50. Among those indicating their 
age, about 26 percent were 39 or younger, 34 percent 
40 to 49, 27 percent 50 to 59, and 13 percent 60 or 
older. 

Departmental Information 

Because the characteristics of the departments in which 
they teach may be associated with faculty members' 
grading ; . actices, information about the highest degree 
awarded, number of majors, teaching loads, and faculty 
perceptions of student quality was collected. 

About 38 percent of the respondents taught in de- 
partments in which the highest degree awarded is the 
bachelor's, 24 percent in departments in which the 
highest degree is the master's, and 38 percent in depart- 
ments that award the doctorate. Slightly more than half 
the respondents from psychology and history depart- 
ments indicated their departments awarded the doc- 
torate as the highest degree, but only 16 percent of the 
business faculty were from departments that award the 
doctorate. Slightly more than half the respondents from 
chemistry and mathematics departments said the 
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TABLE 8 



Percentage of Responding Faculty with Various 
Characteristics (N = 597) 


Background 


Percent 


A. Rank 


} : ull professor 


45 


Associate professor 


28 


Assistant professor 


19 


Instructor 


1 


B. Tenure status 


Tenured 


75 


Not tenured 


25 


C. Employment status 


Full time 


97 


Part time 


3 


D. Teaching experience 


16 or more years 


55 


8 to 1.5 years 


26 


A to 7 years 


11 


0 to 3 years 


8 


K. Sex 


Male 


"6 


Female 


24 


F. Race/ethnicity 


White 95 


Other 


5 



highest degree awarded was the bachelor's, but only 19 
percent of the responding psychology faculty indicated 
this was the highest degree in their department. 

The literature on grading suggests that the level of 
the courses that faculty teach might be associated with 
grading practices. The faculty members in this study, on 
average, devoted about 47 percent of their caching time 
to introductory/lower-division courses, 41 percent to 
advanced/uppcr-division courses, and 22 percent to 
graduate courses. (The percentages do not total 100 be- 
cause some faculty taught no graduate courses.) Faculty 
in mathematics and in chemistry reported spending the 
highest proportion of their time teaching introductory 
courses (both 57 percent). All other faculty averaged 
less than 50 percent of their time teaching introductory 
courses, with education and psychology faculty re- 
porting the least time teaching at this level. 

The departments varied considerably in the number 
of undergraduate majors, ranging from less than 10 to 
more than 100. While nearly half the business depart- 
ments reported having more than 100 majors, none of 
the chemistry departments and only 3 percent of the 
mathematics departments reported this many majors. 
Business, education, and psychology departments aver- 



aged between 76 and 100 undergraduate majors while 
English and history departments averaged between 51 
and 75. Mathematics departments averaged between 26 
and 50 majors while chemistry departments averaged 
1 1 to 25. 

The literature on grading suggests that both the 
number of classes taught and the size of these classes 
might be related to grading practices. Faculty members 
responding to the questionnaire reported teaching, on 
average, 2.2 undergraduate courses each term. Chem- 
istry faculty reported teaching the fewest courses per 
term, with an average of 1.66, while English and busi- 
ness faculty reported teaching the most courses, 2.49 
and 2.46 respectively. Responding faculty members in- 
dicated that their introductory/lower-division classes 
averaged about 55 students and their advanced/uppcr- 
division classes about 19 students. Introductory classes 
tended to be much smaller in English (averaging 34 stu- 
dents), and in mathematics, education, and business (all 
averaging 38 students) than in chemistry (89 students), 
history (71 students), and psychology (66 students). 
There was less variation reported in the size of ad- 
vanced/upper-division classes, with the smallest classes 
in mathematics (18 students) and the largest in psy- 
chology (36 students). The average advanced/ upper- 
division class size reported was 26 students. 

The literature on grading also suggests that faculty 
may adjust their grades to the ability level of their stu- 
dents. In this study, most faculty (69 percent) felt that 
the students in their department were as well prepared 
as students in other departments at their institution; 29 
percent felt their students were better prepared, and 
2 percent felt their students were less well prepared. 
Faculty in chemistry, education, and business depart- 
ments were most likely to feel that their students were 
better prepared than the average student in their insti- 
tution. 

Efforts to Raise Departmental Standards 

Some believe that grading practices have changed in re- 
cent years in response to departmental efforts to raise 
standards. In this study, about half (56 percent) of the 
respondents said their departments had tried to raise 
standards during the previous 10 years, 30 percent said 
their department had not tried to raise standards, and 
14 percent did not know whether or not their depart- 
ment had tried to raise standards during this period (see 
Table 9). Only about a quarter of chemistry department 
faculty (26 percent) said their departments had tried to 
raise standards, while about 85 percent of education de- 
partment faculty reported such efforts. Approximately 
20 percent of chemistry and mathematics faculty said 
they did not know if there had been departmental cf- 
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TABLE 9 



Percentage of Responding Faculty Reporting Dep 
Efforts to Raise Standards, by Field {N - 558) 


art men t 




Yes 


No 


Don't Know 


Business 


73 


14 


13 


Chemistry 


26 


55 


19 


Education 


85 


4 


1 1 


English 


60 


17 


13 


Historv 


53 


41 


6 


Mathematics 


52 


28 


20 


Psychology 


66 


23 


10 


Total 


56 


30 


14 



forts to raise standards, while only 6 percent of history 
faculty indicated they did not know if there had been 
such efforts. 

Departmental Grading Policies and Expected 
Practices 

As indicated earlier, wc believed that grading might be 
influenced by institutional and/or departmental policies 
as well as by individual faculty members' grading 
philosophies and practices. In order to determine the 
kinds of grading behaviors that faculty believed the de- 
partment required or expected, faculty were asked if 
certain grading practices were required by departmental 
policy or were expected departmental practice. The re- 
sults are summarized in Table 10. 

Few faculty reported that there were departmental 
grading policies, supporting the evidence from the de- 
partmental questionnaires. The most frequent policy, 
grading students against specific standards, was re- 
ported by about a quarter of all faculty. Other grading 
policies most frequently mentioned by responding fac- 
ulty were giving students an A or a B in an honors 
course, including attendance in the course grade, and 
grading students in relation to the overall performance 
of the class. 

Departmental expectations about grading were 
more commonly reported than were explicit grading 
policies. Again, grading against specific standards was 
mentioned most frequently (by nearly two-thirds of the 
faculty). Grading students relative to overall class per- 
formance was described as an expected practice by 
about half the respondents as was giving a lower grade 
on work turned in late. Slightly less than a third of the 
respondents said their departments expected them to 
give an A or B to students in an honors course, to in- 
clude attendance in the course grade, to include effort in 
the course grade, or ro grade students on how well they 
progressed toward individual objectives, 



TABLE 10 



Percentage of Responding Faculty Reporting Various 
Departmental Policies and Expected Practices Related to 
Grading 




Official 
Policy 


Expected 
Practice 


Students are graded against specific standards 


26 


63 


Students are graded relative to overall 
performance of class 


10 


51 


Students are graded on how well they progress 
toward individual objective* 


9 


30 


.Students* effort to learn is included in rhcir 
course grade 


3 


30 


Students* attitude and/or behavior is included 
in their course grade 


6 


26 


Student* at.,'ndancc is included in their 
course grade 


1 1 


31 


Students are graded "on the curve'* 




n -*■ 


The average course grade must he between a 
B iSO) and a C (70) 


6 


2> 


Students should receive an A or a B in an 
honors course 


14 


32 


Students should receive an A or a B in a 
ficldwork/internship course 


3 


21 


Students can do extra credit projects to raist 
a grade 


4 


15 


When students turn in work late, it nvjst be 
graded lower 




4" 


When students miss several classes, they receive 
a lower grade 


5 


-> 1 



Faculty Attitudes about Grading 

Faculty were asked about their grading altitudes and 
philosophy, whether their grading philosophy had 
changed since they began teaching, the importance they 
attached to various purposes of grading, their beliefs 
about the variations in the meaning of grades, and the 
factors they believed most influenced the higher grades 
received by current students. 

To assess grading attitudes and philosophy, faculty 
members were asked to indicate the extent to which 
they agreed or disagreed with seven statements (shown 
in Table 1 1). As Table 11 indicates, responding faculty 
were most likely to agree with the statement "I view ed- 
ucation as a continuing process of improvement" (At = 
3.53); there was also a high level of agreement with the 
statement "I emphasize formal, objective measurement 
of knowledge" (Af = 3.01 ). Faculty least agreed with the 
statement "I consider grading as a harmful exercise 
which interferes with actual learning" (M = 1.83); the 
statement "Grading standards must recognize differing 
student backgrounds and interests" (Al = 2.00) also re- 
ceived relatively low endorsement. 

Variations in grading philosophies by department 
arc also shown in Table 1 I . In this and most other ta- 
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Faculty Attitudes about Grading, Means by Fie 
Differences Across Fields 
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hies where differences by department are presented, 
analysis of variance was used to determine if there were 
significant differences among the groups. There were no 
significant differences across fields in views about the le- 
niency of grading (/• =1.1 S7.S, df - 559, /; = .3 1 12) or 
about education as a continuing process of improve- 
ment (/■' = .8513, df - 571, p = .53(F). F.xamination of 
the other five statements reveals that the departments 
clustered into two major groups, one seeing grades as 
formal and objective, and the other believing that 
grades cannot be reduced to a set of objective measures. 
Rather predictably, the "objective" camp includes 
chemistry, mathematics, and psychology departments 
while the "nonobjective" camp includes Faiglish, educa- 
tion, and history departments. Faculty in business ap- 
peared to be the most diverse in their attitude they 
could not easily be classified into either of these two 
groups. 

There were significant differences across fields in 
faculty endorsement of the statement "I emphasize 
formal, objective measurement of knowledge" {/■ - 
26.45X2. df- 5~6,/> = .()()()()), with F.nglish and history 
faculty Jittering significantly from business, chemistry, 
mathematics, and psychology faculty. Fducarion faculty 
differed significantly from chemistry, mathematics, and 
psychology faculty on this item. There were also signif- 
icant differences in endorsement of the statement 
"Grading standards must nCogni/c differing student 
backgrounds and interests" (/■' = 9.K04I, df - 581, /; - 
.()()()()), with education faculty differing significantly 
from chemistry and mathematics faculty, and F.nglish 



faculty differing significant!) from chemistry, mathe- 
matics, and psychology faculty. Faculty aiso differed 
significantly in their endorsement of the statement "1 am 
critical of nonobjective grading practices" 13.2648, 
df = 56 L >, /; = .0000), with chemistry faculty differing 
significantly from business, education, Fnglish, history, 
and psychology faculty, and mathematics faculty dif- 
fering significantly from business, education, Fnglish, 
and history faculty. F.ndorscmcnt of the statement 
"(irades cannot Ix 1 reduced to a set of formal, objective 
measures" (/-" = 17.3K49, df = 567, /; = .0000) showed 
chemistry, mathematics, and psychology faculty dif- 
fering significantly from education, F.nglish, and history 
faculty, and business faculty differing significantly from 
Fnglish and history faculty. Indorsement of the state- 
ment "1 consider grading as a harmful exercise which 
interferes with actual learning" (f = 7.6?S4, J/'= 576, 
f> = .0000) showed education and F.nglish faculty dif- 
fering significantly from chemistry and history faculty. 

Forty-three percent of responding faculty said they 
had changed their grading philosophy since they began 
teaching. As shown in Table 12, this percentage varied 
across departments, ranging from a high of 5S percent 
among education department faculty to a low of 33 per- 
cent among chemistry faculty. 

Faculty were next asked to indicate the importance 
they assigned to each of 1 0 possible purposes of 
grading. The results are shown in Table 13. Faculty as- 
signed highest importance to providing students with 
feedback (Al = 3.64), providing other education institu- 
tions such as graduate or professional schools with in- 



23 



17 



I AIM I' 12 



Percentage of Faculty Who Hud Changed 


Their Grading 


Philosophy since They Began Teaching 






4" 


( henustn * ^ 


iikic.Uion 






So 


1 hstory 




Nt.it lu-iiutus 4! 


|\\chi>lo£\ 


■n 


!nt.*.! 


4* 



formation about students (Al = .3. J. I), motivating stu- 
dents to do good work (M = .V L>), and helping the col- 
lege or department make decisions about students (Al = 
3.05). There were no significant differences across fields 
in the importance assigned to grades as a source of in- 
formation for student decisions (/- = .904.S, J/' = 5X2, 
f) = .4^12) or information for the college to make deci- 
sions (F = . 7.164, df = 581, p = .6205). There was a sig- 
nificant overall / ; foi difference^ across fields in the im- 
portance of grades to provide feedback to students 



(/■ = 2.N69"\ df = 585, /> = .0092), information to other 
institutions (/-' = 2.56, df = 584, /; = .0187), and infor- 
mation to employers (/ ; = 2.8239, df = 584, /; = .0102) 
but, for ail three items, the Seheffe procedure indicated 
no statistically significant differences between any pairs 
of fields. 

There were significant differences across fields in 
the importance of grades as a source of instructor feed- 
back about teaching effectiveness [r = 3. 1386, df = 584, 
/; = .0049), with education faculry differing significantly 
from Knglish faculty. There were also significant differ- 
ences across fields in the importance of grades in moti- 
vating students [F = 5.1795, df = 581, p = .0000), with 
education faculty differing significantly from faculty in 
chemistry, history, and mathematics. Faculty also dif- 
fered significantly in the importance of grades as a way 
of preparing students for the competitive nature of adult 
life (/-" = 5.2400, df = 579, p = .0000), with education 
faculty differing significantly from chemistry and his- 
tory faculty; in the importance of grades to help stu- 
dents iearn discipline for work or jobs (/■' = 6.7850, 
df = 583, /; = .0000), with education faculty differ- 
ing significantly from business, chemistry, I'nglish, 
history, and mathematics faculty; and in the impor- 
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ranee of grades ro help maintain academic standards 
{'r - .3.1824, df - 5K2, p = .0044), with education fac- 
ulty differing significantly from faculty in history. 

Education faculty stand out as differing most from 
their colleagues in other fields. They are more likely to 
see grades as providing instructors with information 
about their teaching effectiveness and are less likely ro 
value grades as a way of motivating students, to help 
students prepare for the competition of adult life, or to 
help maintain academic standards. 

Next faculty were asked how much they thought 
the meaning of a course grade varies: (a) across disci- 
plines in their institution and (b) within their discipline 
across institutions. The scale used was 0 = none, 1 = a 
little, 2 = some, and 3 = a lot. Faculty perceived greater 
variation in grading standards across disciplines (M = 
2.60) than within a given discipline (M = 2.15), The 
level of perceived differences in the meaning of course 
grades across disciplines did not vary significantly by 
field (/■ = 1.0080, df - S35, p = .4192). The level of 
perceived differences within disciplines varied sig- 
nificantly (/ ; = 4.1570, df - 524, p = .0004) across 
fields with chemistry differing significantly from psy- 
chology. 

Faculty were asked why they thought grades are 
generally higher today than they were 20 year ago. 
I'rom a list of 13 possible factors, they were asked to se- 
lect three that they believed were important and then to 
indicate which una of these was most important. A 
quick glance at Table 14, which summarizes the find- 
ings, shows that there was considerable diversity of 
opinion. The reason selected most frequently, both as 
the single most important factor (by 30 percent ol re- 
sponding faculty) and as one of three most important 
factors (by 54 percent), was "Faculty expect less of stu- 
dents/ 1 In second place was "Faculty have less concern 
for maintaining high standards," selected as the most 
important reason by 9 percent of responding faculty 
and as one of the three most important factors by 36 
percent. Other factors that responding faculty also fre- 
quently indicated as important included "A change in 
focus toward student improvement rather than absolute 
standards" and "Liberalized course withdrawal poli- 
cies." The latter finding conflicts with the institutional 
data {see Table 4) that did not show any changes in 
course withdrawal policies. 

Grading Practices 

The next section of the questionnaire asked about fac- 
ulty grading practices, including the grading system{s) 
used, how they thought their grades compared with 
those of other faculty, the importance of various ele- 
ments in assigning grades, the types of assessment used, 
and the testing process. 
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Faculty were first asked if they ever used any or all 
of three grading approaches: ( 1 ) grading based on some 
absolute standard (criterion-referenced); (2) grading 
students in relation to the performance of other students 
(norm-referenced); and (.?) grading students m relation 
to their own improvement (self-referenced). 

As Table 15 shows, SI percent of faculty said they 
sometimes used a criterion-referenced approach to 
grading, while 57 percent sometimes used a norm-refer- 
enced approach and 44 percent sometimes used a self- 
referenced approach. There was no difference across 
fields in the use of a criterion-referenced approach (/ : = 
1.4270, df = 569, /; = .2019). There were significant dif- 
ferences in the use of a norm-referenced approach to 
grading (/■' = ?>.M1.\ df = 559, p = .0029), with Scheffe 
tests showing that business faculty used a norm-refer- 
enced approach significantly more frequently than fac- 
ulty in education. There were also significant differences 
across fields in the use of a self-referenced approach to 
grading (/•' = 9.3700, df = 540, /; = .0000), with educa- 
tion, English, and history faculty using this approach 
significantly more frequently than mathematics faculty 
and English faculty using it significantly more fre- 
quently than business and psychology faculty. 

Next faculty were asked which one of these ap- 
proaches they used must often. About 64 percent of re- 



25 



19 



J AM I 15 



Grading Practices of Responding Faculty ;ind 


Vobnhility o 


f /'- Test Ratio for Significant Differences Across 


Fields 








Percentage Yes 






liusiness 


V.hvmistrv 


Education English I listory Mathematics 


Psychology 


Total 


Miidmts .m- jir.ulctl <m ihe \\,\\ ihrir 

.Klmwimm lc\t l «.iinip.ircs wnli sunk- 

absoluu- sj.iikI.hJ - } 


S4 


"> M S- 


H4 


XI 
/>=ns 


Students .ire gr.nlcd in ivl.tiiun (<> tlu 
piTlcirm.inii' ol other students ~o 


61 


^ hi 4X S2 


6 4 


S~ 
/<-.<)<)> 


Students ,ire gr.idt-d in rel.thon t«i their uwn 











/>=.<)()<) 



sponding faculty indicated rhat they most often used a 
criterion-referenced approach. About 29 percent said 
they most often used a norm-referenced approach, and 
about 8 percent said they most often used a self- 
referenced approach. Variations by Held are shown in 
I able 16. There were no statistically significant differ- 
ences across fields in preference for the criterion- 
referenced approach {/ : = 1.2812, df - 519, /; = .2641) 
or the norm-referenced approach (/•' = 1 .8822, df "= 5 1 9, 
/; = .0820). However, there were significant differences 
in preference for the self-referenced approach {/•' = 
3.4304, <// = 519, /; = .0025), with faculty in education, 
Knglish, and history most likely to prefer this approach 
to grading. The Scheffe procedure showed no significant 
differences across disciplines. 

Responding faculty members tended to believe they 
were somewhat stricter graders than were other faculty 
in their department, in other departments in their insti - 
tution, and in their discipline in other institutions. 
When asked to rate their grades in introductory courses 
in comparison with the grades of others, using'the scale 
1 = lower, 2 = about the same, and 3 = higher, the typ- 
ical response was slightly lower than others in their dis- 
cipline in other institutions (M = 1.92) and others in 
their department (A'l = 1.8 1), and moderately lower 
than faculty in their institution in other departments 
(M = 1.59). Variations across fields are shown in 



Table 17. Education faculty were the only group to re- 
port that they graded somewhat higher than faculty in 
other departments of their institutions. Chemistry fac- 
ulty were the only group to report that they graded 
somewhat higher than faculty in the same field in other 
institutions. There were no significant differences across 
fields in regard to how grades were believed to compare 
with other faculty in the same department 1.6769, 
<// = 496, /; .1246) or with faculty in the same disci- 
pline in other institutions (/• = .3761, df - 287, 
f) = .8939). There were significant differences when fac- 
ulty compared their grading with faculty in other 
departments at their own institution (/• = 15.9316, 
(//'= 420, /; = .0000). There were also significant differ- 
ences between chemistry faculty and faculty in business, 
education, Hnglish, and psychology; between mathe- 
matics faculty and faculty in business, education, and 
psychology, and between faculty in education and those 
in Knglish and psychology. 

baculty were asked to indicate the importance they 
gave to various factors in grading introductory/lower- 
division courses and in grading advanced/upper- 
division courses, The results are summarized in Table 
18. Tests and quizzes had the strongest influence on 
grades in introductory courses (M = 3.61) and some- 
what less influence in advanced courses (/VI = 3.^8), 
while papers and written assignments, which had 
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somewhat less importance in introductory courses 
(Af = 3.1) I I, carried the most weight in advanced courses 
(Al = 3.57). Averaged across the seven fields, oral re- 
ports, written assignments, creativity, class participa- 
tion, group projects, and subject-specific skills and tech- 
niques took on increased importance as students moved 
from introductory to advanced level courses, while tests 
and quizzes declined in importance. 

Departmental variations in the importance of var- 
ious factors in grading introductory courses are shown 
in Table 19 along with the probability of the /--test ratio 
for significant differences across fields. Hxcepr for the 
importance of adhering to due dates (/ ; = 2.0619, 
df '= 524,/; = .0561), there were statistically significant 
differences across disciplines for each of these factors. 
However, although departments showed significant 
variations across fields (/•' = 2.733 i, df = 524, p - 
.0000), the Scheffe procedure showed no pairs of de- 
partments that differed significantly with respect to 
these factors. 



Tests and quizzes varied in importance (/- = 
1 1.9438, df = 530, /; = .0000), with faculty in English 
placing significantly less importance on tests when de- 
termining grades in introductory courses than faculty in 
business, chemistry, history, mathematics, and psy- 
chology; faculty in education placed significantly less 
importance on tests than faculty in business, chemistry, 
mathematics, and psychology. There was also signifi- 
cant variation across fields (f = 27.4984, df = 527, /) = 
.0000) in the importance of papers, with faculty in 
mathematics, chemistry, and business giving signifi- 
cantly less importance to papers than faculty in English, 
education, and history. Oral reports also showed signif- 
icant differences across fields (f = 9.4922, df = 503, 
/; = .0000), with education faculty giving them signifi- 
cantly more importance than faculty in business, chem- 
istry, English, history, mathematics, and psychology, 
and faculty in English giving them significantly more 
importance than faculty in chemistry. A similar pattern 
occurred with respect to group projects (/ : - 12.1219, 
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df - 490, /; = .0000), with education faculty giving 
them significantly more importance than faculty in 
business, chemistry, English, history, mathematics, and 
psychology. 

Attendance also varied considerably in importance 
as a component of introductory course grades (/*' = 
22.3079, df = 524, p = .0000), with chemistry faculty 
giving it significantly less importance than business, ed- 
ucation, English, and history faculty. Mathematics 
and psychology faculty placed significantly less impor- 
tance on attendance than did faculty in education, Eng- 
lish, and history. Class participation was significantly 
more important as a component of grades for introduc- 
tory courses in English, education, history, and business 
than in chemistry, psychology, and mathematics 
{overall / ; = 29.8559, df = 526, p = .0000). Psychology 
faculty placed significantly less importance on skills 
and techniques than faculty in the other six fields; 
the /'-ratio across departments for the importance 
of skills and techniques was 12.7782 {df = 500, 
/; = .0000). 



Faculty in English and history placed significantly 
more importance on student improvement when as- 
signing grades in introductory courses than did facul'v 
in mathematics, psychology, and chemistry; student im- 
provement also showed significant variation across dis- 
ciplines (F = 11. 0496, df = 526, p = .0000). Creativity- 
was significantly more important in English and history 
than in psychology, mathematics, chemistry, and busi- 
ness, and significantly more important in education 
than in psychology and mathematics. The impor- 
tance of creativity varied significantly across fields 
(V = 14.4839, df = 510, /; = .0000). the importance 
assigned to students' attitude and effort varied con- 
siderably across departments (/ ; = 10.7004, df =522, 
/; = .0000), with faculty in English, education, and his- 
tory placing more importance on this factor than faculty 
in mathematics and chemistry; faculty in English and 
history also gave significantly more importance to atti- 
tude and effoi than faculty in psychology. 

The ability level of students played an important 
role in introductory course grades, varying significantly 
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across fields {F = ".8582, df = 495, p = .0000), with fac- 
ulty in English considering ability more important than 
those in psychology, chemistry, and mathematics, and 
faculty in history giving it more importance than faculty 
in psychology. Faculty also varied in the extent to which 
they considered student background characteristics (F = 
9.69^6, df = 523, }) = .00(H)), with those in Hnglish 
giving them significantly more importance than those in 
mathematics, chemistry, and business, those in educa- 
tion giving them more importance than those in mathe- 
matics, and those in history giving them more impor- 
tance than those in mathematics and chemistry. There 
was also significant variation in the extent to which fac- 
ulty considered students' personal circumstances when 
assigning grades (/■' = 5.0892, df = 523, p = .0000), with 
faculty in F.nglish significantly more likely to consider 
them when grading than faculty in mathematics and 
chemistry. 

Table 20 summarizes the factors faculty considered 
most important in introductory course grades across 
fields. Chemistry, psychology, and mathematics faculty 
tended to adopt relatively simple models for grading in- 
troductory courses, with tests taking on high impor- 
tance and only one or two other factors, such as subject- 
specific skills, also considered. In contrast, education, 
Hnglish, and history faculty adopted rather complex 
models for grading introductory courses. In both edu- 
cation and Knglish, papers and other written work were 
considered more important than tests and, in addition. 



eight or nine other factor^ were included in grading in 
these subjects. Creativity, improvement, and class par- 
ticipation were moderately important in grades in edu- 
cation, Hnglish, and history. Business tended to fall be- 
tween these two groups in the factors faculty considered 
important in grading introductory courses. 

Departmental variations in the importance of these 
same factors in grading advanced-level courses are 
shown in Table 21 along with the probability of signif- 
icant differences across fields. Papers and other written 
assignments (AI = 3.52) superseded tests and quizzes 
(M = 3.38) as the most important factor in grading ad- 
vanced courses. 

All the factors showed significant variation for ad- 
vanced courses across the seven fields. However, the 
importance of adherence to due dates (/' = 2.5462, 
df = 539, p - .0194) showed no significant differences 
between pairs of fields. Tests and quizzes had signifi- 
cantly more importance for grades in advanced courses 
in chemistn and mathematics than in Hnglish, educa- 
tion, and histor\. Business and psychology faculty also 
placed significantly more importance on tests and 
quizzes than faculty in Hnglish, Variation in the impor- 
tance of tests and quizzes across departments yielded an 
of 13.8415, df= 540, p = .0000. The pattern for the 
importance of papers and written assignments was al- 
most the reverse, with Hnglish, education, history, and 
psychology faculty considering these more important 
than mathematics or chemistry faculty; the over-all / ; 
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Departmental Means for Importance Given to Various Factors When Assigning Grades in Advanced Courses and Probability 
of /"-Test Ratio for Significant Differences Across Fields 
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was 16.0054, c//' = 545, p = .0000. Oral reports (/ : = 
5.4873, df = 527, /; = .0000) rook on significantly 
more importance for grades in education, psychol- 
ogy, and business than in mathematics. Group projects 
(F = 14.3659, df = 502, /; =: .0000) were more impor- 
tant in business and education than in history, mathe- 
matics, chemistry, and English; they were also signifi- 
cantly more important in business than in psychology. 

Attendance (F = 167542, df = 538, /; = .0000) was 
significantly more important as a component of ad- 
vanced course grades in English, education, history, and 
business than in chemistry and mathematics; attendance 
was also significantly more important in English, edu- 
cation, and history than in psychology. Class participa- 
tion (F = 19.771 0,\//' = 541, //= .0000) was significantly 
less important in advanced courses in chemistry and 
mathematics than in business, history, English, and ed- 
ucation. Skills and techniques (/- = 7.5818, df = 522, 
{) - .0000) were a significantly more important compo- 
nent of grades for advanced courses in education, his- 



tory, mathematics, and English than in psychology. Im- 
provement (/ ; = 6.9727, (//*= 539, /; ~ .0000) was more 
important for grades in history and English than in 
mathematics, and more important in English than busi- 
ness. Creativity (F = 4.7615, df = 531, /; = .0000) was 
significantly more important for grades in English and 
history than in chemistry. 

Attitude and effort (F = 8.3919, df = 542, 
p = .0000) was significantly more important in English, 
education, history, and business than in mathematics, 
and more important in English than in chemistry. Fac- 
ulty in English placed significantly more importance on 
students" ability level when grading advanced courses 
than did faculty in education, chemistry, psychology, 
and mathematics; the F-ratio was 6.1851 (df = 515, 
/; = .0000). The importance given to students' back- 
ground characteristics also varied (F = 8.8193, df - 536, 
p ~ .0000), with faculty in English, history, and educa- 
tion considering this factor more important than those 
in mathematics, faculty in English and history consid- 
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cring it more important than those in chemistry, and 
faculty in Fnglish considering it more important than 
those in business. Personal circumstances of students 
also had a varying influence on grades in advanced 
courses (/ : = 5.0382, <//'= 537, p = .0000), with English 
faculty considering this factor significantly more impor- 
tant than faculty in mathematics or chemistry. Finally, 
departmental standards also had a variable influence on 
advanced course grades {/■' = 3.5680, df - 539, p = 
.0000), with chemistry faculty placing significantly 
more important on this factor than faculty in psy- 
chology. 

Table 22 summarizes, across fields, the factors fac- 
ulty considered most important in grading advanced- 
level courses. Chemistry, mathematics, and psychology 
faculty still tended to use less complex models for 
grading, but even these were more complex than for in- 
troductory courses. In chemistry and mathematics, tests 
remained dominant and class participation was still 
unimportant. Psychology faculty emphasized papers 
and ga\c some attention to class participation. Cre- 
ativity, which was important only in education, Fnglish, 
and history at the introductory level, now became im- 
portant in every discipline. Education, Hnglish, and his- 
tory faculty continued to use complex grading models, 
but they were joined by business. In business and his- 
tory, papers replaced tests as the most important factor 
in grading. Oral reports, which were not given much 



importance in grading introductory courses, took on 
some importance in grading advanced courses in busi- 
ness, education, and psychology; business joined educa- 
tion in using the results of group projects to make 
grading decisions. 

Because some fields give considerably more impor- 
tance to laboratory work, internships, or rieldwork than 
do others, faculty were asked if they taught any courses 
in which at least half the final grade was based on field- 
work, internship, or laboratory work. Fourteen percent 
indicated that they taught such courses. There were sig- 
nificant differences across fields (/ ; = 16.3851, (//'= 577, 
/; = .0000). Forty-three percent of education, 26 percent 
of psychology, 19 percent of chemistry, and 14 percent 
of business faculty said they taught such courses. Edu- 
cation faculty taught this type of course significantly 
more frequently than did faculty in any other field; 
chemistry and psychology faculty taught this type of 
course significantly more frequently than did faculty in 
Fnglish, history, or mathematics. 

Faculty who. reported teaching laboratory, field- 
work, or internship courses were asked to indicate the 
importance of each of the grading factors in these 
courses in comparison to their importance in other 
courses. The results are summarized in Table 23, Cau- 
tion is necessary when considering these results since the 
responses are based on approximately T 5 faculty, pri- 
marily faculty in education, psychology, chemistry, and 
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Importance Given to Various Factors in Assigning Grades in Laboratory, Ficldwork, and Internship Courses Versus 
Other Courses 
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business. Two factors had the greatest importance in 
grading laboratory and fie Id work courses — skills and 
techniques and papers and written assignments. Atten- 
dance, attitude and effort, and adherence to due dates 
were also given great importance. Class participation, 
creativity, improvement, and requirements for the pro- 
fession (a grading factor not included for introductory 
or advanced courses) appeared to have moderately high 
importance in such courses. The greatest difference be- 
tween laboratory and ficldwork courses and other 
courses was the lower importance given to tests and 
quizzes. In these courses, faculty gave greater impor- 
tance to attendance, attitude and effort, skills and tech- 
niques, requirements for the profession, class participa- 
tion, and improvement than they did in their other 
courses. There was significant variation across depart- 
ments regarding the requirements of the profession 
when grading laboratory/field work courses (F = .3.0421, 
df = 74, /; = .0154), with education faculty placing sig- 
nificantly more importance on this factor than faculty in 
other fields. 

Assessment in Introductory Courses 

Faculty were asked about the type(s) of assessment they 
used in their introductory courses, who prepared the 
tests, and the source(s) for test content. 

As Table 24 shows, responding faculty were more 
likely to use essay examinations than multiple-choice 
tests. The types of assessment used differed significantly 



across fields. Essay tests (F = 32.9212, df = 445, p = 
.0000) were used significantly more frequently by Eng- 
lish and history faculty than chemistry, mathematics, 
and business faculty; they were used significantly more 
frequently by education and psychology faculty than by 
chemistry and mathematics faculty, and significantly 
more frequently by business faculty than by mathe- 
matics faculty. Multiple-choice tests (/- = .39.8679, df = 
466, p ~ .0000) were used significantly more frequently 
by psychology and business faculty than by English and 
mathematics faculty; psychology faculty used multiple- 
choice tests significantly more frequently than chemistry 
and history faculty. Short-answer tests (F - 2.9032, df = 
454, /; = .0087) were used significantly more frequently 
by chemistry faculty than English faculty. Performance 
assessment (F = 6.6893, df = 372, /; = .0000) was used 
significantly more frequently by education, mathe- 
matics, and business faculty than by history faculty. 

When asked who prepared the tests for introduc- 
tory courses, about 77 percent of faculty stated they 
prepared their tests alone. About 8 percent prepared 
tests for introductory courses with other faculty who 
also taught these courses. About 6 percent of faculty 
who taught introductory courses involved a teaching as- 
sistant in the preparation of examinations. 

About .30 percent of responding faculty said they 
got the items for the tests in their introductory courses 
from a textbook publisher; about 8 percent said they 
got their items from a departmental item bank. There 
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were significant differences across fields in the use of 
these item sources. Items from textbook publishers (F - 
27.6312, df = 562, /; = .0000) were used significantly 
more frequently by psychology faculty than by faculty 
in English, history, mathematics, and chemistry; busi- 
ness faculty were significantly more likely to use pub- 
lishers' items than faculty in English, history, mathe- 
matics, and chemistry, while chemistry and education 
faculty were significantly more likely to use such items 
than faculty in English. Departmental item banks (F = 
5.2571, df = 522, /; = .0000) were used significantly 
more frequently by chemistry faculty than by faculty in 
English or history. 

Student Evaluations and Challenges to 
Grading 

A final group of questions dealt with pressures that may 
make faculty consider modifying or changing their 
grading standards. These pressures may come from the 
use of student course evaluations, either by the depart- 
ment or by the institution, and/or from challenges to the 
grades faculty have given. 

Nearly all (99 percent) of responding faculty indi- 
cated that student evaluations of faculty took place in 
their courses. Faculty reported that these evaluations 
were used for tenure decisions (83 percent), promotion 
decisions (81 percent), retention decisions (76 percent), 
and merit pay decisions (57 percent), as well as to help 
improve instruction (77 percent). While most faculty 
said they seldom or never considered their reputation 
with students and how they would be evaluated by 
them when assigning course grades (34 percent said 
they did not consider these factors, 35 percent said they 
gave them very little consideration), other faculty were 
more concerned. Twenty-six percent of faculty said they 
sometimes gave these factors some consideration, while 
6 percent said they gave them a lot of consideration. 



Sometimes faculty are challenged about or pres- 
sured to change a grade. In this study, about 65 percent 
of responding faculty said students seldom or never 
challenged a grade, while about 30 percent said they 
had such challenges from one or two students in each 
class they taught and about 3 percent said they were 
usually challenged by about 5 percent of the students in 
a class. Only about 12 percent of the faculty said that, 
when challenged, they usually changed the student's 
grade. There was significant variation across depart- 
ments in the extent of student challenges (/-' = 5.41 19, 
df = 568, /; = .0000), with chemistry faculty being chal- 
lenged significantly more frequently than faculty in Eng- 
lish or mathematics. Relatively few (13 percent) of fac- 
ulty said they had ever been asked by a member of the 
faculty or administration to change a student's grade. 

Discussion and 
Conclusions 

This report summarizes institutional policies related to 
grading at 14 colleges and universities and how these 
policies changed between 1980 and 1990. The report 
also summarizes grading policies and practices in the 
business, chemistry, education, English, history, mathe- 
matics, and psychology departments at these institu- 
tions and the grading orientation and practices of fac- 
ulty in these departments. 

Changes Between 1980 and 1990 

Information about grading policies and practices was 
obtained from institutional, department chair, and fac- 
ulty questionnaires. Four areas of change that may be 
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related to eroding were identified: (1) changes in cur- 
riculum requirements; (2) changes in grading systems; 
(.3) increased use of student evaluations to make deci- 
sions about faculty; and (4) changes in faculty grading 
philosophies and attitudes. 

The institutional questionnaires indicated that cur- 
rent curriculum requirements at these institutions 
tended to he more prescribed than they had been a 
decade ago. Seven of the 14 institutions had added gen- 
eral education courses that all students were required to 
take. This rinding, which is similar to those of other sur- 
veys (e.g., Lewis and Harris 1989; Toombs and Fair- 
weather 1989), suggests that changes in general educa- 
tion requirements during the 1980s may have affected 
the grades students received, especially in the first two 
years of college when general education courses are 
most frequently taken. Students may be less motivated 
to achieve in required courses than in elective courses, 
which they may find more interesting. Also, the required 
curriculum limits students' opportunities to take "easy" 
courses. Institutional practices may also affect the 
grades in required general education courses. For ex- 
ample, required courses may have larger class sizes than 
other courses and/or may be taught by less experienced 
faculty. As noted in the review of relevant literature, 
both class size and faculty teaching experience have 
been found ro be inversely related to strict grading prac- 
tices. The reported curriculum changes may be con- 
tributing to lower GPAs at the institutions in this 
survey. 

The institutional questionnaires also indicated that 
grading systems at these institutions tended to be more 
differentiated in 1990 than they had been in 1 980. Fight 
of the 14 colleges and universities were using a letter- 
grade system with pluses and minuses in 1990, a system 
used by only 4 of these institutions in 1980. At one uni- 
versity the change to a more differentiated system had 
been mandated by the administration as part of an ef- 
fort to combat grade inflation. This change appears to 
have produced the desired results at this institution but 
it is unclear whether it has had similar effects at the 
other colleges and universities studied. In earlier re- 
search on this topic. Potter (1979) found that adding 
pluses and minuses to letter grades resulted in higher 
CiPAs than were produced by the simpler five-letter 
grading system. 

Still another area of change that may have affected 
grades is the use of student evaluations of faculty to 
make decisions about faculty retention, promotion, and 
tenure (and, less often, salaries). Flcven of the 14 col- 
leges and universities indicated that such evaluations 
were required institution-wide in 1990, while only 8 
had used such evaluations in 1980. In each case the in- 
stitution indicated that these evaluations were used in 



retention, promotion, and tenure decisions. Faculty 
questionnaires also indicated wide use of student evalu- 
ations; 99 percent of responding faculty said they were 
used in their department and more than 75 percent said 
these evaluations affected tenure, promotion, and reten- 
tion decisions as well as helping the department im- 
pro\e instruction. This extensive use of student evalua- 
tions to make major decisions about faculty, especially 
during a period when faculty positions have become in - 
creasingly difficult to find and keep, raised the question 
that some faculty might be afraid to grade too harshly 
lest they receive unfavorable student reviews and jeop- 
ardize their careers. This concern was confirmed in the 
faculty questionnaire. Nearly a third (32 percent) of re- 
sponding faculty said that, when assigning grades, they 
gave a lot or some consideration to how they would be 
evaluated by their students. This policy may contribute 
unwittingly to grade inflation. 

Faculty members were asked directly to indicate 
what they thought were the three most important rea- 
sons for any grade inflation that occurred between 1970 
and 1990. Thirty percent of the respondents said that 
the most important reason was that faculty now expect 
less of students; 54 percent said this was one of the three 
most important reasons. Other frequently reported rea- 
sons were that faculty now have less concern for main- 
taining high standards and that faculty have changed 
their grading focus away from absolute standards and 
toward student improvement. An estimate of tin* extent 
to which charges in faculty grading philosophy may 
have occurred can be determined h\ the fact that 
slightly more than 40 percent of responding faculty in- 
dicated they had changed their philosophy since they 
began teaching. 

Institutional and Departmental 
Influences on Grades 

This study found that relatively few of the colleges and 
universities surveyed had prescribed grading policies. 
Of those that did, the most common policy was grading 
against specific standards. This policy was reported at 
three institutions and had not changed between 1980 
and 1990. According to the department chairs, there 
were no departmental policies regarding grading prac- 
tices. Flowever, faculty had a different perspective. Ap- 
proximately a quarter (26 percent) of responding fac- 
ulty said their department had a policy requiring 
grading against specific standards. One possible reason 
for this discrepancy is that faculty believed that institu- 
tional policies requiring grading against standards (in 
place at three institutions) were departmental policies. 



28 



3-1 



Approximately 1 1 percent of faculty reported a depart- 
mental policy concerning including attendance in the 
course grade, a figure that also suggests confusion with 
the institutional policy concerning this factor at one col- 
lege. However, while no institution or department had 
a policy about grades in honors courses, 14 percent of 
faculty believed that there was such a departmental 
policy. 

Faculty were much more likely to report that cer- 
tain grading practices were expected in their depart- 
ment. Nearly two-thirds {6$ percent) of responding fac- 
ulty said their department expected them to grade 
against specific standards. About half (51 percent) of 
faculty said their department expected that they would 
grade students relative to the overall performance of the 
class and nearly half (47* percent) said their department 
expected them to give a lower grade it students were 
late in turning in work. Between a quarter and a third 
of responding faculty indicated their departments ex- 
pected that students in honors courses would get an A 
or a B, that attendance and/or effort would be included 
in the course grade, that individual progress would be 
included in the course grade, that students would" be 
graded "on the curve," and/or that attitude and be- 
havior would be included in the course grade. Depart- 
mental chairs probably communicated such expecta- 
tions, either in discussions about grading or in 
conjunction with their review of course grades. Depart- 
mental review of grades was reported in all the less se- 
lective institutions; such review was least common in 
the more selective institutions. In addition, all the de- 
partment iiairs in less selective institutions reported 
holding informal discussions about grading. These find- 
ings suugesr that departments in the less selective insti- 
tutions are the most likely to be concerned about main- 
taining expected grading practices. 

Overview of Faculty Grading 
Philosophies and Practices 

Overall, responding faculty were most likely to say that 
they emphasized formal, objective measurement of 
knowledge and that the\ graded students in terms of 
how their achievement level compared wirh some 
absolute standard. Faculty gave the most importance to 
tests and quizzes when they assigned grades in their 
introductory courses and were most likely to use essay 
tests when assessing students in these courses. In ad- 
vanced courses, faculty tended to give more importance 
to papers and written assignments than to tests and 
quizzes. Those faculty who taught laboratory or field- 
work courses also tended to place more importance on 



papers and written assignments than on tests and 
quizzes. Faculty members resp mding to this survey be- 
lieved that the most impor«\ 4 nt purpose of grading was 
to give feedback to students. 

Departmental Variations in 
Grading 

This study confirmed what other studies have found — 
there are many significant differences across depart- 
ments in grading philosophies and practices. Faculty 
differed significantly in their attitudes about grading, 
with those in the sciences more likely to say they em- 
phasized formal objective measurement of knowledge 
than those in the humanities or in preprofessional areas 
(such as business and education). However, there were 
no significant differences across departments in the per- 
centage of faculty who said they use or preferred a 
criterion-referenced approach to grading. 

T!i ere were significant differences in the importance 
that faculty in the different departments assigned to v ar- 
ious grading factors, both for introductory and for ad- 
vanced courses. In introductory courses, faculty in busi- 
ness, chemistry, history, mathematics, and psychology 
considered tests and quizzes most important, while fac- 
ulty in education and Fnglish considered papers and 
written assignments most important. Kxplanations for 
these preferences may be associated with the nature of 
the subject or with other characteristics of the course, 
such as typical class size. Such preferences appear to be 
linked to the type of assessment used. In introductory 
courses, psychology faculty were most likely to use 
multiple-choice tests, while Knglish and history faculty 
tended to use essays and mathematics faculty used per- 
formance tests. In advanced courses, chemistry and 
mathematics faculty gave the most importance to tests 
and quizzes, while faculty in the other departments con- 
sidered papers and written assignments most important. 

The importance of tests for grades in introductory 
chemistry, history, and psychology courses may be re- 
lated, in part, to the relatively large size of introductory 
courses in these subjects. Similarly, the relatively small 
size of introductory Fnglish classes, as well as the nature 
of the subject, may explain the greater emphasis given 
to papers and written assignments. The smaller size of 
advanced-level courses may also account for the shift in 
emphasis from tests to papers and written assignments 
by faculty in business, history, and psychology. Both 
chemistry and ma them a ties, which use relatively si tuple 
grading models that emphasize tests at both the intro- 
ductory and advanced course levels, are subjects in 
which reduction of information by quantification plays 
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an important role. In contrast, education and F.nglish, 
which use more complex grading models that empha- 
size papers and written assignments at both levels, are 
subjects dealing with behaviors that are considered to 
have complex explanations that cannot be readily quan- 
tified. 

There were fewer significant differences in faculty 
opinions about the purposes of grading than in grading 
philosophies and practices; faculty in mathematics and 
history were more likely to view grades as a way of mo- 
tivating students than were faculty in education and 
psychology. 

Additional Analyses and Future 
Research 

There are several additional analyses that could be car- 
ried out with this data to provide further information 
about how the grading process varies and about how it 
may have changed. 

It would be highly desirable to analyze this data by 
institutional characteristics. As indicated in the litera- 
ture review, grades tend to be slightly less predictable in 
institutions that enroll students with a wide range of 
academic ability, once there is a correction for restric- 
tion of range; grades also may vary more in institutions 
with a very diverse curriculum. In addition, variations 
in faculty grading philosophies and practices in more 
and less selective institutions need to be explored. 

It would also be useful to compare the current 
grading philosophies and practices of faculty who say 
they have and have not changed their approach to 
grading since they began teaching. Such an analysis 
would help confirm the hypothesis that faculty are now- 
more focused on student improvement and less on ab- 
solute standards than they were in the past; it would 
also help confirm the argument that faculty are now less 
concerned with high standards. Finally, such an analysis 
might shed some light on the changing expectations that 
faculty have f' ,r students. 

The muki-level nature of these data also needs to be 
explored lurther. For example, to what extent do fac- 
ulty grading practices differ when the use of specific 
grading standards is mandated by the institution or is 
an expected practice of the department? I low do faculty 
grading approaches vary in departments that do and do 
not review the grades their faculty give; are such varia- 
tions related to the use of part-time faculty? Also, it 
would be interesting to do some correlational analyses 
of these data to determine how the different compo- 
nents are interrelated. This might provide the ground- 
work for a future study to determine the extent to 



which grades are affected by institutional, depart- 
mental, faculty, and student characteristics. 

Future studies of grading should be designed so that 
the findings can be linked to actual grade records. It 
would be especially useful if comparisons could be 
made between institutions that have shown higher and 
lower rates of grade inflation. 



Grading in the Mid-1990s 

Does this report provide any clues about whether and 
how the predictability of the CPA will change? The an- 
swer is "yes, there arc clues," but the clues are some- 
what contradictory. 

There appears to be considerable pressure on insti- 
tutions of higher education and their faculties to reduce 
what the public perceives as lax standards resulting in 
ever rising (iPAs. Institutions seem to be taking steps to 
respond to this, primarily by introducing curriculum re- 
quirements. At the departmental level, slightly more 
than half the responding faculty reported efforts to raise 
standards. There also appears to be informal pressure 
for faculty to meet certain expected departmental 
grading standards, especially at the less selective institu- 
tions in this survey. Both of these trends might be ex- 
pected to lead to lower overall (iPAs and to make the 
CiPA more predictable. 

However, other forces are at work that may have 
the opposite effect. Many institutions are using a more 
differentiated grading system. This was shown, in the 
past, to be associated with inflation of the CiPA. It is not 
entirely clear from these data whether or not greater dif- 
ferentiation is having this effect at the institutions in this 
survey that have adopted a system of letter grades with 
pluses and minuses, Another, potentially more powerful 
force for grade inflation is the increased use of student 
course evaluations to make decisions about faculty ca- 
reers. As we move through the 1990s, competition to 
enroll and retain students appears to be a matter of in- 
creasing concern in higher education. Faculty at institu- 
tions most in danger of loosing students who receive 
low grades are likely to feel increased pressure to 
modify grades sufficiently so that students will recnroll. 
These faculty members are also likely to be concerned 
about their own futures and, especially among junior 
faculty, feel it necessary to adapt their grading stan- 
dards to their students. Nearly a third of responding 
faculty said they gave some or a lot of consideration to 
the impact on student evaluations when assigning 
course grades. Nearly half of responding faculty said 
they had changed their grading philosophy since they 
began to teach. Finally, about half said that grade infla- 
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tion had occurred because faculty today expect less of 
students than they did in the past. 

Just how these competing forces will affect col- 
lege grades in the middle and late 1990s remains to be 
seen. 
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